BIO 



® 



J 



Europaisches Patentamt ' 
European Patent Office 
Office europeen des brevets 




(g) Publication number: 0 459 630 A2 



EUROPEAN PATENT APPUCATION 



@ Application number: 91303868.3 
@ Date of ffling : 29.0491 



© mt ci. 5 : C12N 15/27, C12P 21/02, 
C07K 15/00, A61K 37/02 



© Priority: 30.04.90 6B 9009623 

20.06.90 GB 9013773 
2407.90 GB 9016215 

11.02.91 GB 9102799 



(33) Date of publication of application : 
0412.91 Bulletin 91/49 

(§4) Designated Contracting States : 

AT BE CH DE DK ES FR GB GR IT U LU NL SE 



(R) Applicant: IMPERIAL CHEMICAL 
INDUSTRIES PLC 
Imperial Chemical House, Millbank 
London SW1P 3JF (GB) 



@ Inventor : Camble, Roger 
8 Yew Tree Close 

Macclesfield, Cheshire SK11 8NY (GB) 

Inventor : Wilkinson, Anthony James 

4 Ravenho Lane 

Macclesfield, Cheshire (GB) 

Inventor : Carr, Heather 

16 Princess Street 

Bollington, Cheshire (GB) 

Inventor : Timms, David 

45 Beech Lane 

Macclesfield, Cheshire (GB) 

(74) Representative : Mack, John Richard et al 
Imperial Chemical Industries PLC Legal 
Department: Patents PO Box 6 Bessemer 
Road 

Wetwyn Garden City Hertfordshire AL7 1HD 
(GB) 



@ Polypeptides. 

(S?) Derivatives of naturally occurring G-CSF having at least one of the biological properties of naturally 
occurring G-CSF, and a solution stability of at least 35% at 5 mg/ml are disclosed in which the derivative 
has at least Cys 17 of the native sequence replaced by a Ser 17 residue and Asp 27 of the native sequence 
replaced by a Ser 27 residue. 

Nucleotide sequences coding for part or all of the amino acid sequence of the derivatives of the 
invention may be incorporated into autonomously replicating plasmld or viral vectors employed to 
transform or transfect suitable procaryotic or eucaryotic host cells such as bacteria, yeast or vertebrate 
cells in culture. 
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The present invention relates to derivatives of granulocyte colony stimulating factor (G-CSF) having good 
solution stability and to processes for their preparation as well as to pharmaceutical compositions containing 
them. 

The colony stimulating factors are a class of protein hormones which stimulate the proliferation- and the 
5 function of specific blood cell types such as granulocytes. Granulocytes engulf and devour microbial invaders 
and cell debris and thus represent a vital factor in response to infection. In this regard granulocytes can extend 
pseudopods and slip out of the vascular tree between the lining endothelial cells. The neutrophilic granulocytes 
can then come into direct contact with the microorganisms and destroy them using unique enzyme systems 
such as those which generate superoxide anions. Since granulocytes have only a short life span in the circu- 
10 lation (approximately 6-12 hours) and are destroyed in the course of their function, it is necessary for the stem 
cells of the bone marrow to generate as many granulocytes as red blood cells each day. Further, this rate of 
production of granulocytes needs to increase enormously if the demands of infection are to be met As a result 
of their fast turnover, the granulocyte count falls rapidly if the bone marrow is damaged for example by cancer 
chemotherapy, radiation, AIDS or haematologlcal disorders and patients become liable to overwhelming infec- 
ts tion. Indeed sepsis is a common cause of death in cancer patients whose marrow is suppressed by radiation 
treatment, chemotherapy or their neoplastic disease. 

Granulocyte colony stimulating factor (G-CSF) has been described in the literature by Wallet K. etal Proc. 
Natl. Acad. Sci. U.SA Vol 82, pp 1526-1530 and has also been described in European Patent Publication No 
169,566 and PCT Patent Publication No WO 87/01 132. G-CSF has been shown to stimulate granulocyte pro- 
20 duction jn vivo and to function with minimal side effects. As a result human G-CSF is seen as having potential 
utility in the management of neutropaenia associated with chemotherapy, radiation therapy, radiation accident 
or autologous bone marrow transplantation. Moreover G-CSF may have utility in the stimulation of bone marrow 
suppression associated with AIDS, in the treatment of myelodysplastic syndromes characterised by granulo- 
cyte functional abnormalities and as an adjunct to the treatment of severe infections. 
25 In addition to the above certain analogues of G-CSF have been described In PCT Patent Publication No 
WO 87/01132, in European Patent Publication No 243,153, in European Patent Publication No 256,843, In 
European Patent Publication No 272,703 and in Biochemical and Biophysical Research Communication [1989] 
Vol.159, No 1, pp 103-1 1 1 Kuga T. et al. Furthermore, modification of G-CSF and [Ser"]G-CSF has been effec- 
ted by substituting the cysteine and serine residues at position 1 7, but such changes failed to achieve the des- 
30 ired effect (Protein Engineering, Vol 3. No.4 page 360 (1990)). 

G-CSF and the analogues referred to above tend to suffer from solution instability in that on standing they 
tend to precipitate out of solution thus resulting in short shelf life and problems in storage at high concentrations. 
Moreover G-CSF and certain of the analogues referred to above have a tendency to covalent aggregation on 
storage. 

35 The present invention is based on the discovery of modifications that may be made to a G-CSF or a deri- 
vative thereof having part or at! of the amino add sequence and at least one of the biological properties of natur- 
ally occurring G-CSF, for example of naturally occurring human G-CSF, whereby to improve solution stabflity. 

Thus according to one feature of the present invention there is provided a derivative of naturally occurring 
G-CSF having at least one of the biological properties of naturally occurring G-CSF and a solution stability (as 
40 herein defined) of at least 35% at 5mg/ml , the said derivative having at least Cys 17 of the native sequence rep- 
laced by a Ser 17 residue and Asp 27 of the native sequence replaced by a Ser 27 residue. 

The derivatives of the present invention may conveniently have at least one further modification selected 
from:- 

a) Glu" of the native sequence replaced by an Arg 11 residue; 
45 b) Leu 18 of the native sequence replaced by a Glu 15 residue; 

c) Lys 23 of the native sequence replaced by an Arg 23 residue; 

d) Gly 26 of the native sequence replaced by an Ala 26 residue; 

e) Gly 28 of the native sequence replaced by an Ala 28 residue; 

f) Ala 3 © of the native sequence replaced by an Lys 30 or Arg 30 residue; 
so g) Lys 34 of the native sequence replaced by an Arg 34 residue; 

h) Lys 4 © of the native sequence replaced by an Arg 40 residue; 

i) Pro 44 of the native sequence replaced by an Ala 44 residue; 
j) Leu 49 of the native sequence replaced by a Lys 49 residue; 
k) Gly 51 of the native sequence replaced by an Ala 51 residue; 

55 I) Giy» of the native sequence replaced by an Ala 55 residue; 
m) Tip 58 of the native sequence replaced by a Lys 53 residue; 
n) Pro 60 of the native sequence replaced by a Ser 80 residue; 
o) Pro 65 of the native sequence replaced by a Ser 45 residue; 
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p) Pro 111 of the native sequence replaced by a Gtu 111 residue; 

q) Thr 115 of the native sequence replaced by a Ser 115 residue; 

r) Thr 11 * of the native sequence replaced by a Ser 116 residue; and 

8 ) Tyriw 0 f the native sequence replaced by an Arg 185 residue. 
5 The presence of at least one further modification selected from (b) to (s) is preferred, but the presence of 

at least one further modification selected from (b), (d), (e), (f), (n) and (o) is particularly preferred of which further 
modification (o) is especially preferred. 

More preferably the further modification comprises at least one of the following:- 

0 Gin 11 , Pro 80 * 85 of the native sequence replaced by Arg 11 , Ser 60 * 65 ; 
10 ii) Ala 111 , Thr 115 - 116 of the native sequence replaced by Glu 111 , Ser 115 - 118 ; 

Hi) Gin 11 , Trp 88 , Tyr 188 of the native sequence replaced by Arg 11 - 165 . Lys 88 ; 

iv) Leu 16 , Gly** 3 , Ala 3 © of the native sequence replaced by Glu 18 Ala*** 8 Lys 3 * or 

v) Asp* Pro** Leu*®, Glv 81 - 88 , Trp» of the native sequence replaced by Lys** 88 , Ala*** 81 - 88 
The further modification may also, preferably comprise at least one of the following:- 

15 vi) Leu 18 , Gly* 8 * 8 , Ala 3 © of the native sequence replaced by Glu 15 , Ala 28 * 28 , Arg 3 ©; or 
vu) Pro 88 of the native sequence replaced by Ser 85 ; or 
vfii) Pro 80 - 65 of the native sequence replaced by Ser 80 * 68 ; or 
ix) Gin 11 , Pro 88 of the native sequence replaced by Arg^Ser 85 . 

The above defined modifications may thus, if desired, be introduced into any polypeptide having at least 

20 one of the biological properties of naturally occurring G-CSF in order to improve the solution stability of the 
molecule. The modifications of the present invention may thus be applied to such polypeptides which differ in 
amino acid sequence from that specified herein for the naturally occurring G-CSFs in terms of the identity or 
location of one or more residues (for example substitutions, terminal and internal additions and deletions). As 
examples such polypeptides might include those which are foreshortened, for example by deletions; or those 

25 which are more stable to hydrolysis (and, therefore, may have more pronounced or longer lasting effects than 
natural ly occurring); or which have been altered to delete one or more potential sites for O-glycosyiation (which 
may result in higher activities for yeast-produced products); or which have one or more cysteine residues 
deleted or replaced, for example by alanine or serine residues and are potentially more easily isolated in active 
form from microbial systems; or which have one or more tyrosine residues replaced by phenylalanine and may 

30 bind more or less readily to human G-CSF receptors on target cells. The proposed modifications (a) to (s), pref- 
erably (i) to (ix) may thus, for example be applied to either native G-CSF having Cys 17 of the native sequence 
replaced by Ser 17 or to allelic variants and analogues thereof known to possess at least one of the biological 
properties of naturally occurring G-CSF such as those described in the publications referred to above. 

Polypeptides of the present invention that have been tested have been found to possess improved solution 

35 stability over the corresponding unmodified polypeptide whilst either retaining significant biological activity or 
even having improved biological activity. 

It will be understood from the above that the property of solution stability is different from that of solubOity. 
Solution stability Is the decreased tendency of a substance to precipitate from solution under physiological con- 
ditions of pH, temperature and ionic strength. 

40 Solution stability is measured herein by determining the percentage of G-CSF derivative remaining In sol- 
ution in phosphate buffered saline after 1 4 days at 37°C given an initial concentration of 1 mg/ml , 5mg/ml and/or 
10mg/ml. Measurement of solution stability is described in detail hereinafter in Reference Example 4. Conve- 
niently polypeptides of the present invention will have a solution stability at 5mg/ml of at least 35%, advan- 
tageously at least 50% and preferably at least 75%. Preferably the polypeptides of the present invention will 

45 have a solution stability at 10mg/ml of at least 75%, especially at least 85%. 

The expression "naturally occurring G-CSF° as used herein refers to those G-CSFs that have been found 
to exist in nature and includes the two polypeptides having the amino acid sequence set out in SEQ ID No37. 
These two polypeptides differ only in so far as a trlpeptide insert Val-Ser-Glu is present in one polypeptide be- 
tween positions 35 and 36, but absent in the other. The numbering system used throughout the present speci- 

50 fication is based on the naturally occurring polypeptide without the Val-Ser-Glu insert and the term "native" as 
used herein refers to this polypeptide without the Val Ser Glu insert It will be appreciated that the present inven- 
tion is applicable to all naturally occurring forms of G-CSF and analogues thereof as described above and con- 
sequential revision of the position numbers of the polypeptide may be necessary depending on the form of 
naturally occurring G-CSF selected for modification. 

55 According to a further feature of the present invention there is provided a DNA sequence encoding all or 
part of the amino acid sequence of a derivative of naturally occurring G-CSF as hereinbefore defined. Such 
sequences may, for example include 1) the incorporation of codons preferred for expression by selected non- 
mammalian hosts; 2) the provision of sites for cleavage by restriction endonucleases; and/or 3) the provision 
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of additional initial, terminal or intermediate DNA sequences which facilitate construction of readily expressed 
vectors. The DNA sequences of the present invention include those useful in securing expression in procaryotic 
or eucaryotic host cells and the derivatives of the present invention may be in either glycosylated or non- 
glycosyfated form depending upon the host cell selected. Where the derivative of the present invention is 
5 obtained in non-glycosyiated form, for example following expression in procaryotic host cells, the derivative 
may, if desired, be glycosylated chemically for example with mammalian or other eucaryotic carbohydrates. 

According to a further feature of the present invention there is provided a recombinant vector containing a 
DNA sequence as hereinbefore defined. The recombinant vector may for example be a biologically functional 
plasmid or viral DNA vector. 

10 According to a further feature of the present invention there is provided a process for the preparation of a 
recombinant vector as hereinbefore defined which comprises inserting a DNA sequence as hereinbefore 
defined into a vector. 

According to a further feature of the present invention there is provided a procaryotic or eucaryotic host 
cell stable transformed or transfected with a recombinant vector as hereinbefore defined. 
15 According to a further feature of the present invention there is provided a process for the preparation of a 
procaryotic or eucaryotic host cell as hereinbefore defined which comprises transforming or transfecting a pro- 
caryotic or eucaryotic cell with a recombinant vector as hereinbefore defined whereby to yield a stably trans* 
formed or transfected procaryotic or eucaryotic host 

According to a further feature of the present invention there is provided a process for the preparation of a 
20 derivative of naturally occurring G-CSF of the present invention which comprises culturing a procaryotic or 
eucaryotic host cell of the invention whereby to obtain said derivative. The process will advantageously also 
Include the step of isolating the said derivative produced by expression of the DNA sequence of the invention 
in the recombinant vector of the invention. 

The host cells for use in processes of the present invention are preferably procaryotic such as Exoli, but 
25 may be yeast ceils such as Saccharomyces cerevisiae or mammalian cells such as CHO cells (Chinese hamster 
ovary cells). 

According to a further feature of the present invention there is provided a pharmaceutical composition com- 
prising as active ingredient at least one derivative of naturally occurring G-CSF of the present invention in 
association with a pharmaceutical! y acceptable carrier or excipient 
so According to a further feature of the present invention there is provided a method for providing 
haematopoietic therapy to a mammal which comprises administering an effective amount of a derivative of the 
present invention. 

According to a further feature of the present invention there is provided a method for arresting the prolif- 
eration of leukaemic cells which comprises administering an effective amount of a derivative of the present 
35 invention. 

Brief Description of the Drawings 

Figure 1 shows the nucleotide sequence of the 167 bp fragment referred to in Example 1; 
40 Figure 2 shows the amino acid sequence and corresponding nucleotide sequence of native human (hu) 
G-CSF and restriction sites; 

Figure 3 shows the amino acid sequence and corresponding nucleotide sequence of [Ser 17 * 27 ] hu G-CSF 
and restriction sites. 

Figure 4 shows the nucleotide sequence of the T4 transcription terminator having (a) terminal Sail and Hin- 
45 dill restriction sites; and (b) terminal Sail and Styl restriction sites; 

Figure 5 shows a restriction map of pTB357 (also referred to herein as pLB004); 

Figure 6 shows the nucleotide sequence of the EcoRI-Sall fragment referred to in Reference Example 6(b) 

but omitting the interferon gene sequence; 

Figure 7 shows a restriction map of pLB015 (also referred to herein as pICI 0080); 
so Figure 8 shows a restriction map of pIC1 1079; 

Figure 9 shows a restriction map of pICI 54 (also referred to herein as pCG54; 
Figure 10 shows a restriction map of pCG61; 

Figure 1 1 shows a restriction map of pIC1 1 1 07 in which the shaded area represents the gene sequence 
coding for [Ser^hu G-CSF; 
55 Figure 12 shows a restriction map of pCG300 (also referred to herein as pIC1 1295. 
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Detailed Description 

Advantageously the derivatives of the present invention are selected to possess one of the further modifi- 
cations (i), (ii). 0"). (iv). (v). (vi). (vii), (viii) or Ox) or as hereinbefore defined, preferably one of the further mod- 
ifications 0), 00. (iv), (vi), (vii), (viii) or (ix) and especially further modification (ii), (iv), (vi), (vii), (viii) or(ix). 

Particularly preferred derivatives according to the present invention by virtue of their good solution stability 
include:- 

[Aigii Ser'WecesjG-CSF; 

[G!u« Set"*, Ala 28 * 8 , Lys*>]G-CSF; 

[Aig" Glu** Set 1 weo^s, Ala 28 - 28 , Lys^G-CSF 

[Argn- 23 ser^-^-ssjG-CSF 

[Arg".* 4 , Seriweo^sjG-CSF 

[AigiW Sei^W^G-CSF 

[Alai.Thi«,Tyi 4 ^rg6.",Ser 17 ^ 60 ^ 5 ]G-CSF 

r^rgii Glu 16 - 111 Ser 17 ^^^»" 5 '" 8 ,Ala 28 ' 28 , Lys^G-CSF 

|Ara.ii.i66 Glu 16 , Ser 17 ' 27 . 60 * 65 , Ala 28 * 28 Lys^^]G-CSF 

[Arg", Glu 16 Sef 17 - 27 ' 80 * 86 , Ala 28 ^ 8 * 44 - 51 ' 55 , Lys^^^G-CSF 

[Arg"« l8S , Glu 1s - 111 , Ser 17 * 27 * 60 ' 85 - 115 - 118 , Ala 28 ' 28 * 44 * 61 , 55 » Lys^^G-CSF 

[GluisSefi^a^^rg^G-CSF 
Especially preferred derivatives of the invention by virtue of their excellent solution stability and good speci- 
fic acitfvity include:- 

(0 [Arg", Sei^ww^G-CSF, 

10 [Glu* Ser^ 7 ^, Ala 28 * 8 , Lys^G-CSF, 

iii) [Arg", Glu 18 Scr 17 * 7 ' 8085 , Ala 2 * 28 , Lys^G-CSF, 
Iv) [Arg"- 40 Ser^^G-CSF, 

v) [Arg". 23 , Ser^^^JG-CSF, 

vi) [Arg". 165 , Glu« Ser 17 ^ 7 ' 80 - 85 , Ala 28 ^ 8 , Lys^G-CSF 

vfi) [Arg" GluW", ser 17 ^ 80 ^ 116 '" 8 , Ala 28 - 28 , Lys^JG-CSF, 
viii) [GIu« Se^^Ia^^rg^G-CSF, and 
Ix) [AlanTh^Tyr^^rg^^.Seri^^G-CSF 

x) [Ser^ 7 ^^jG-CSF, 

xi) [Arg",Seri 7 *W]G-CSF. and 
xi) tSeriw^G-CSF 

of which (0. (ii). M). (vn). (viii) and (xfl) are most preferred. 

These latter human G-CSF derivatives show not only excellent solution stability properties, but also pos- 
sess improved specific activity over naturally occurring human G-CSF. 

A presequence methionine may be either present or absent in the polypeptides of the present invention 
but is conveniently present 

It has been found advantageous to employ a production vector based on pAT153, comprising:- 

i) a promoter and where appropriate an operator therefor, for example a trp promoter or a T7A3 promoter. 
The T7A3 promoter is the A3 promoter of bacteriophage T7 [see Dunn J.J. and Studier F.W. J. Mol. Biol. 
166, 477-535 (1983)]. The complete nucleotide sequence of bacteriophage T7 DNA and the locations of 
T7 genetic elements are set out in this reference; 

ii) a ribosome binding site sequence, for example a trp leader ribosome binding site sequence; 
ill) a cloning site for the gene to be expressed; 

iv) a T4 transcription termination sequence (see SEQ ID No. 51 and Figure 4) 

v) a cer sequence (Summers D. eta^MGG, 201, p334-338, 1985) 

vi) a tetracycline repressor gene (Tet R) 
vfl) tetracycline resistance gene (Tet A) 

viii) multiple restriction enzyme recognition sequences 

SEQ ID No 50. sets out a sequence which includes an EcoRI restriction endonuclease site (nucleotides 
1-6), the A3 promoter sequence (nucleotides 7-52), the trp leader ribosome binding site sequence (nucleotides 
53-78) and the translation initiation codon (nucleotides 79-81) 

It may be advantageous to cultivate the host capable of expressing a derivative of the invention, in a growth 
medium and adding a supplement which includes yeast extract to the growth medium during cultivation. It is 
preferable that addition of the supplement which includes yeast extract is initiated at a predetermined time after 
the start of cultivation. The rate of addition of the supplement which comprises yeast extract is preferably such 
that the growth medium does not become exhausted of yeast extract This is particularly advantageous where 
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the production vector is used with a T7A3 promoter. 

It may also be advantageous to cultivate a host, transformed with a recombinant vector carrying genetic 
material coding for a derivative of the present invention, in the presence of leucine and/or threonine in an amount 
sufficient to give improved accumulation of the derivative of the present invention. Thus It is particularly advan- 
5 tageous to effect the fermentation in the presence of leucine where the production vector is used with the trp 
promoter. 

In addition to the discovery of modifications that may be made to a G-CSF or derivative thereof having part 
or all of the amino acid sequence and at least one of the biological properties of naturally occurring G-CSF, to 
improve solution stability, the present invention is further based on the discovery of modified techniques for 

10 the purification of such G-CSFs and derivatives thereof. 

Thus for example there is no disclosure in PCT Patent Publication No WO 87/01 1 32 of the removal of deter- 
gent, particularly N-lauroyi sarcosine in salt form (eg. Sarkosyi) from the G-CSF analogues prepared in this 
PCT Publication. It was therefore necessary to identify such a technique in order that the solution stability of 
the G-CSF derivatives of the present invention could be assessed at high concentration and in order that for- 

15 mulation studies could be conducted. In one embodiment of the invention detergent removal was effected in 
the presence of a phosphate buffered saline (pH 7.2 - 7.5). The phosphate buffered saline may conveniently 
be prepared from Isotonic saline and may thus for example have a composition as described in Example 1. In 
this regard it was found that other buffers were less preferred since either detergent removal, particularly N- 
lauroyl sarcosine (in salt form) removal, was slower or more protein precipitated out It is further preferred to 

20 effect diaftltration, preferably at this stage, since this was found to improve efficiency without provoking 
increased protein precipitation. For example diafiltration was found to be preferable to conventional diffusion 
dialysis. Furthermore it was found that detergent concentration, particularly N-lauroyl sarcosine in salt form (eg. 
Sarkosyi) concentration, could be reduced below 1% whOst retaining resolution during chromatography. A 
reduction in initial detergent concentration assists detergent removal and thus it is preferred to use the minimum 

25 concentration of detergent, for example N-lauroyl sarcosine (in salt form eg. Sarkosyi), consistent with retaining 
resolution during chromatography. A particular concentration of detergent, for example N-lauroyl sarcosine (in 
salt form) eg. Sarkosyi, is thus from 0.8% to 0.2%, preferably from 0.5 to 0.2%, especially about 0.3%. 

In addition to the above it was found that the removal of detergent such as N-lauroyl sarcosine (in salt form) 
e.g. Sarkosyi activates a trace of proteolytic activity which may complicate product evaluation. It has further 

30 been found that this proteolytic activity may be significantly reduced and even eliminated if, after detergent 
removal by diafOtration, the pH is reduced to below 7,0 before substantial proteolysis, conveniently by diaftl- 
tration and preferably by dialysis. Thus in a further embodiment of the present invention the reduction or removal 
of trace proteolytic acitrvity may be effected at a pH that is below 7.0 but which is sufficiently high to avoid sig- 
nificant hydrolysis of the polypeptide. The pH is advantageously in the range 6.0 to 4.5, preferably 5.8 to 5.0 

35 especially about 5.4. A further advantage of this embodiment of the invention is that Exoli contaminants and/or 
degraded or incorrectly folded protein can be precipitated by effecting this lowering of pH. It Is preferred that 
purification include the step of size exclusion chromatography since otherwise the problem of proteolytic deg- 
radation is increased and whilst the present embodiment will reduce such degradation it makes it difficult to 
eliminate. 

40 In addition to the above processes, the introduction of solution stability into a G-CSF or derivative thereof 
enables substantial simplification of the process of extraction. Thus according to a further feature of the present 
invention there is provided a process for extracting an active derivative of the invention (as hereinbefore 
defined) from an inclusion body thereof which comprises 1) suspending said inclusion body in a detergent, par- 
ticularly N-lauroyl sarcosine in salt form (e.g. Sarkosyi) 2) oxidation, 3) removal of detergent for example as 

45 hereinbefore described and 4) maintaining solution obtained following removal of detergent at an elevated tem- 
perature forexample 30-45°C, advantageously 34-42°C whereby to precipitate contaminating bacterial protein, 
product oligomers and/or degradation products. The said solution is conveniently maintained at said elevated 
temperature for from 6-24 hours, advantageously 8-18 hours preferably 10-14 hours, especially about 12 
hours. 

so The extraction process of the present invention may for example be effected by iysing host cells followed 
by centrifugation to obtain the inclusion body forexample in the form of a pellet The inclusion body may then 
be suspended In a detergent such as, for example N-lauroyi sarcosine in salt form (eg Sarkosyi), preferably 
1-3%, especially about 2% N-lauroyl sarcosine in salt form (eg. Sarkosyi). Suspension in detergent may be fol- 
lowed by oxidation, for example in the presence of copper sulphate (CuS0 4 ) which in turn may be followed by 

55 centrifugation. 

Where it is possible to wash the inclusion body it is preferred to use urea rather than for example deoxycho- 
late. 

The extraction process of the present invention enables the production process to be simplified forexample 
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by elimination of the need for the use of size exclusion columns. Moreover the high recovery of product from 
the heat treatment step appears to be one of the advantages of the increased solution stability of the derivatives 
of the present invention. Indeed the greater the solution stability the more suited is the protein to the new ext- 
raction process. Thus for example it is preferred to apply this extraction process to the extraction of derivatives 
of the present invention having a solution stability of at least 85% at 10 mg/ml. When the known analogue 
[Met- 1 , Ser 17 ] G-CSF was extracted by the above process, rpHPLC Indicated that only 40% of the desired pro- 
duct remained in solution after heat treatment of a retentate containing 1 mg/ml total protein. At 3 mg/ml total 
protein, only 19% of the analogue remained in solution. 

All nucleotide sequences referred to herein are specified in the conventional 5' - 3' sense. 
The derivatives of the present invention are based on human G-CSF which is also referred to as hu G-CSF. 
Since the derivatives prepared in the Examples are all prepared using E.coii, a presequence methionine will 
generally be present 

The following materials are referred to hereinafter in the Reference Examples and Examples and their con- 
stitution Is as follows:- 

The term "N-lauroyl sarcoslne" as used herein refers to the use of the said substance in salt form. Thus in 
the Examples N-lauroyl sarcosine is used in the form of the sodium salt 

BUFFERS FOR RESTRICTION ENZYMES 

Stability! stable at -20°C. 
Buffer composition: 

Buffer components Final concentration in mmol/1 

(1:10 diluted set buffer) 
A B L M H 

Tris acetate 

Tris-HCl 

Hg-acetate 

MgCl 2 

K-acetate 

NaCl 

Dithioerythritol (DTE) 
Dithiothreitol (DTT) 
2-Mer cap t oe thanol 



pH at 37°C 7.9 8-0 7.5 7.5 7.5 

The above buffers are available from Boehringer Mannheim. 

In the site-directed mutagenesis procedure - Reference Example 2 
Buffer 1 lOOmM Tris HClpH 8.0 

100 mM NaCl 

20 mM MgCIa 
Buffer 2 1 0 mM Tris HCI pH 8.0 

20 mM NaCl 

1 mM EDTA 
Buffer 3 12 mM Tris HCI pH 7.7 

30 mM Nad 

10 mM MgCI 2 

8 mM 2-mercapto ethanol 



33 

10 10 10 50 

10 

5 10 10 10 

66 

- 100 - 50 100 
111 

0.5 

1 
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Buffer 4 60 mM Tris HCI pH 8.0 
90 mM Nad 
6 mM MgCi 2 
10 mM DTT 

Nucleotide mix 1 250 \xfA each of dATP, dGTP, dCTP=S (phosphorothioate derivative of dCTP), dTTP and 1 
mM ATP 

Nucleotide mix 2 250 \iM each of dATP, dGTP, dCTP, dTTP and 350 nM ATP 
M9 minimal media 



10 



15 



Ammonium chloride 

Disodium hydrogen orthophosphate 

Potassium dihydrogen orthophosphate 

Sodium chloride 

In distilled water 



1* 
6g 
3g 
0.5g 
1 1. 



20 



25 



30 



SupplementsrtSml 

300 til 50% glucose 

75 pi IMMgSO* 

75 pi 0.1MCaCI 2 

75 }il 4 mg/mi thiamine 

75 jil 20% casin amino acids 

Trace Element Solution (TES) 

TES has the following composition:- 



35 



A1C1 3 6B 2 0 


0.1 mg l" 1 


CoCl 2 6H 2 0 


0.04 mg l" 1 


KCr(S0 4 ) 2 12H 2 0 


0.01 mg l" 1 


CuCl 2 2H 2 0 


0.01 mg l" 1 


H3BO3 


0.005 mg 1" 


KI 


0.1 mg l" 1 


MnS0 4 H 2 0 


0.1 mg l" 1 


NiS0 4 6H 2 0 


0.0045 ng 1" 


Na 2 Mo0 4 2H 2 0 


0.02 mg l" 1 


ZnS0 4 7H 2 0 


0.02 mg l" 1 



-1 



100 wg 1" 

40 iig r 

10 pg 1" 

10 Mg 1* 

5 Mg 1" 

100 Mg 1" 

100 Mg 1* 

4.5 Mg 1" 

20 Mg 1" 

20 Mg 1" 



50 and Is added to growth media at O.S ml/I 



Genedean (TM) 



The kit contains 1) 6M sodium iodide 2) a concentrated solution of sodium chloride. Tris and EDTA for mak- 
55 ing a sodium chloride/ ethanol/water wash; 3) Glassmilk (TM)- a 1 .5 ml vial containing 1 .25 ml of a suspension 
of silica matrix in water. 

This is a technique for DNA purification based on the method of Vogelsteln and Gillespie published in Pro- 
ceedings of the National Academy of Sciences USA (1 979) Vol 76, p 61 5. 
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Alternatively any of the methods described in "Molecular Cloning - a laboratory manual" Second Edition, 
Sambrook, Fritsch and Maniatis (Cold Spring Harbor Laboratory, 1989) can be used. 

Random Label Kit Product of Pharmacia No 27-925Q 

5 

The procedure is described in "Molecular Cloning - a Laboratory Manual" Second Edition, Sambrook, Fritsch 
and Maniatis, pp 10.13-10.17 (Published by Cold Spring Harbor Laboratory 1989). 

Sequenase (TM) 

10 

Chemically modified T7 DNA polymerase 

Based on the procedure of Tabor and Richardson published in "Proceedings of the National Academy of 
Sciences USA (1987) vol 84 pp 4767-4771. 

15 

T4 DNAIigase 

Described in "Molecular Cloning - a Laboratory Manual" Second Edition, Sambrook, Fritsch and Maniatis 
5.60-5.64 (Published by Cold Spring Harbor Laboratory 1989) and also by Weiss B. et al J. Biol. Chem. Vol 
20 243 p 4543 (1968). 

The following non-limiting Examples are given by way of illustration only. 

Example 1 

25 Preparation of [Ser 17 - 27 ] human G-CSF 

The procedure for steps a) and b) in Reference Example 1 was repeated with the following modifications: 
Oligonucleotides SEQ ID Nos 24, 25, 26 and 27 (as hereinafter defined) replace SEQ ID Nos 1, 2, 3 and 
4 (as hereinafter defined) respectively. 
so c) Cloning of the gene for [Ser 17 « 27 ] human G-CSF into an expression vector 

The gene described above (see Figure 3 and SEQ ID No. 49) was cloned into piasmid vector plCI0020. 
This vector is a pAT153 based piasmid in which the 651 bp EcoRl-Accl region is replaced by a 167 bp EooRI 
- Clal fragment (SEQ ID No.47) consisting of> 

(1) a synthetic E. coli trp promoter and trp leader ribosome binding site 
35 (2) a translation initiation codon 

(3) a multiple restriction enzyme recognition sequence derived from M13mp18, containing sites for Kpnl, 
BamHI.Xbal, Sail, Pstl, Sphl and Hindlil 

(4) a synthetic transcription termination sequence 
The DNA sequence of this region is shown in Figure 1. 

40 The plCI0020 expression vector was digested to completion with Kpnl (BCL) In 1 0mM Tris Ha (pH7.5), 
1 0mM magnesium chloride. The DNA was precipitated with ethanol at -20°C from a solution containing 0.3M 
sodium acetate and then the 3'- sticky ends were removed by treatment with T4 DNA polymerase for 10 minutes 
at37°C as follows:- 

DNA (1ug) In water (16uJ) 
45 10XT4 polymerase buffer (2uJ) 

0.33M Tris acetate pH7.9 
0.1 M Magnesium acetate 
0.66M Potassium acetate 
5mM dithiothreitol 
so ImgAnl bovine serum albumin (BSA PENTAX fraction V) 

2mM dNTP mixture (1uJ) 
T4 DNA polymerase (1 ui; 2.5 unKs/ul BCL) 
Water (80uJ) was added and the mixture extracted with phenol/chloroform (lOOui) and then with 
chloroform (100*il). The DNA was precipitated with ethanol (250uJ) at -20°C after addition of 3M sodium ace- 
55 tate (10uJ) then digested to completion with Sail (BCL) in 150mM NaCI, 10mM MgCl a and 10mM Tris HCI 
(pH7.5). The Kpn-blunt ended to Sail vector was purified from a 0.7% agarose gel and isolated by use of Genec- 
lean (trademark) following the manufacturer's (Bio101, USA) recommended procedure. 

The synthetic gene was isolated from the pSTP1 vectors as follows. The vectors were digested with Seal 
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and Sail (both from BCL) in 100mM Nad, 10mM MgCfe and 10mM Tris HQ (pH7.5). The 530 bp fragment 
was purified from a OJ% agarose gel and isolated by use of Geneclean (trademark) following the manufac- 
turer's (Bio101) recommended procedure. 

For ligation, a mixture of the Seal - Sal! gene fragment (50ng) and the p1CI0020 vectorfragment (lOOng) 

5 in 20jil of a solution containing 50mM Tris HCI (pH7.6), lOmM MgCI 2 , 1mM ATP, 1mM DTT, 5% w/v PEG 
8000 and T4 DNA Mgase (2 units; BRL) were incubated at 16°C for 20 hours. The resulting mixture was used 
to transform competent E. coli HB101 cells (as supplied by BRL) as described herein. Transformants were 
selected for by growth on L-agar plates containing 50|ig/ml ampicillin and screened for the presence of the 
gene by colony hybridisation with a 32 P labelled probe (SEQ ID No 24) as described herein. Plasmid DNA was 

10 prepared from 6 positively hybridising colonies, purified by centrifugation in a caesium chloride gradient and 
the sequence confirmed by dideoxy sequencing as described herein. 
The plasmid containing this gene was designated pIC1 1080. 

d) Subdon'mg of an expression cassette containing a gene for [Ser^^G-CSF into M1 3mp18. 

The following subdoning was effected to provide a starting point for preparation of the 6-CSF derivatives 
15 detailed in Examples 3-8. 

Plasmid DNA from pICI1080 (purified by caesium chloride density centrifugation) was digested to com- 
pletion with EcoRI and Sail (BCL) according to the manufacturer's instructions. The small EcoRI-Sali fragment 
containing the trp promoter and (Ser^^G-CSF gene was isolated from a 0.7% agarose gel by use of Genec- 
lean (trademark). This fragment was cloned into an E coRI-Sal I cut M1 3mp1 8 vector (DNA supplied by Amer- 
20 sham International; enzymes from BCL). The fragments were iigated together in 5x BRL ligation Buffer using 
BRL T4 DNA ligase (described previously). The ligation mix was used to transfect competent E.coli TG1 cells 
(made competent according to the calcium chloride method of Mandel and Higa described in Molecular Cloning 

- A Laboratory Manual - Maniatts et a! Cold Spring Harbor). The transfected cells were suspended in TY top 
agar containing 2% X-Gal in DMF and 200yi log phase E. coli TG1 ceils and were plated on 2x TY agar plates 

25 (TY top agar - 8g Bactotryptone, 5g Yeast Extract, 5g NaCI, 3.75g Bacto-agar in 500pl sterile H 2 0; TY plates 

- 8g Bactotryptone, 5g Yeast-extract, 5g NaCI, 7.5g Bactoagar in 500 mi sterile H 2 0.) Four white plaques were 
picked into 4 x 2 ml 1% E. coli TG1 cells in TY broth (8g Bactotryptone, 5g Yeast extract, 5g NaCI in 500ml 
sterile H 2 0) aiiquots and grown for 6 hours at 37°C. The 2ml cultures were split Into O.Sml and 1.5ml aliquots. 
The bacteria were centrifuged out of solution in an Eppendorf, (trademark) microfuge and the supematents were 

30 transferred to sterile eppendorf (trademark) tubes. The 0.5ml aliquots were stored at -20°C as phage stocks. 
The 1.5ml aliquots were used to prepare single stranded DNA following the method in the Amersham Inter- 
national M13 sequencing handbook (see below). These DNA samples were then sequenced using oligonuc- 
leotides SEQ 1D No 22, SEQ 1D No 23 and M13 Universal sequencing primer. The reactions were carried out 
using the Sequenase kit (trademark) according to the manufacturers instructions. AD 4 clones had the correct 

35 DNA sequence for [Ser^^G-CSF. 

Large-scale single stranded DNA preparation 

For single stranded DNA preparations of between 200-500yg of DNA/mi, the method in the Amersham 
40 International "Oligonucleotide Directed Mutagenesis" was used. A detailed procedure is carried out as follows:- 

LARGE - SCALE SINGLE STRANDED DNA PREP: 

A. Preparation of 1ml phage stock 

45 1. Pick a single TG1 E.coli colony from a glucose/minimal medium plate. Grow overnight in 10ml 2 x 

TY medium, shaken at 37°C. Add 1 0jJ to 20ml of fresh medium, and shake at 37°C for 3 hours. 

2. Inoculate 1 ml 2 x TY medium in a 10ml sterile culture tube with lOOjil of 3 hour culture from step 

.1. 

3. Inoculate the 1ml culture with a recombinant plaque. 

so 4. incubate for 4 hours with shaking at 37°C. Transfer to a microcentrifuge tube. 

5. Centrifuge for 5 minutes at ambient temperature. Pour supernatent into a fresh tube. 
Store overnight at 4°C. Set up an overnight culture of TG1 Exoll for the next stage. 

B. Growth of 100ml phage culture. 

1 . Inoculate 1 0Oml 2 x TY medium with 1 ml of overnight TG1 culture and shake at 37°C to an O.D500 
55 of 0.3. 

2. Add the 1ml phage supernatent from A5 (above) to the 100ml culture. 

3. Incubate for 5 hours with shaking at 37°C. Transfer to centrifuge tubes. 

4. Centrifuge at 5000 x g for 30 minutes at 4°C. 
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5. Transfer supematent to a clean centrifuge tube. Take care not to carry over any cells (retain bacterial 
pellet for RF DNA preparation) 

6. Add 0.2 volumes of 20% w/v PEG 6000 in 2.5M NaCI to the supematent Mix well and then leave 
to stand for 1 hour at 4°C. 

7. Centrifuge at 5000 x g for 20 minutes at 4°C. Dscard supematent 

8. Centrifuge at 5000 x g for 5 minutes, and remove all remaining PEG/NaCI with a drawn out Pasteur 
pipette. 

9. Resuspend the viral pellet in 500^ water (double distilled) and transfer to a microcentrifuge tube 
(1.5ml). 

10. Centrifuge for 5 minutes in a microcentrifuge to remove any remaining cells. Transfer the super- 
natent to a fresh microcentrifuge tube. 

11. Add 200uJ 20% PEG 12.5M NaCI to the supematent mix well then leave to stand at ambient tem- 
perature for 15 minutes. 

12. Centrifuge for 5 minutes, discard supematent 

13. Centrifuge for2 minutes. Carefully remove ail traces of PEG/NaCI with a drawn out Pasteur pipette. 

14. Resuspend the viral pellet in 500^1 double distilled water. 

15. Add 200ul phenol saturated with 10mM Tris HCI pH8.0, 1mM EDTA. Vortex briefly. 

16. Stand tube for 15 minutes at room temperature. 

17. Centrifuge for 3 minutes. 

18. Transfer supematent to fresh tube. 

19. Repeat steps 15-18. 

20. Add 500uJ chloroform and extract aqueous phase twice. 

21. Add 50uJ 3M sodium acetate and 1ml absolute ethanol. Mix. 

22. Place in a dry ice and ethanol bath for 20 minutes. 

23. Centrifuge for 15 minutes. 

24. Wash each pellet with 1 ml -20°C ethanol. Pour off. 

25. Vacuum dry pellet and raise in 50uJ double distilled water. This procedure yields 1 00-200ug single 
stranded DNA. 

e) Fermentation 

pICI 1080 was transformed into E.coli strain MSD 522 and the resultant recombinants purified and main- 
tained on glycerol stocks at -80°C. 

An aliquot of the culture was removed from stock and streaked onto agar plates of L-ampicillin to separate 
single colonies after overnight growth at 37°C. A single desired colony was removed and resuspended in 10 
ml L-ampicillin broth and 100uJ Immediately inoculated into each of 10 250 mi Erienmeyer flasks containing 
75 ml L-ampiciDin broth. After growth for 1 6h at 37°C on a reciprocating shaker the contents of the flasks were 
pooled and used to inoculate a fenmenter containing 20L LCM50 growth medium. 

Composition of LCM5Q 




KH 2 P0 4 

Na 2 HP0 4 

NaCI 

Casein hydrolysate (Oxoid L41) 



3.0 



0.5 



6.0 



2.0 



(NH 4 ) 2 S0 4 



10.00 
10.00 



Yeast Extract (Difco) 
Glycerol 



35.00 
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L-Leucine 2.5 

L-Threonine q.9 

HgS0 4 . 7H 2 0 o.5 

CaCl 2 . 2H 2 0 o.03 

Thiamine 0.008 

FeS0 4 /Citric Acid 0.94/0.02 

Trace element solution (TES) 0.5ml 



Fermentations were then carried out at a temperature of 37°C and pH, controlled by automatic addition of 6M 
sodium hydroxide solution, of pH 6.7. The dissolved oxygen tension (601) set point was 50% air-saturation 

15 and was initially controlled by automatic adjustment of the fermenter stirrer speed. Air flow to the fermenter, 
initially 20Urnin, corresponding to 1 volume per volume per minute (WM) was increased to SOUmin (2.5 WM) 
when the fermenter stirrer speed approached 80-90% of its maximum. Since the oxygen transfer rate (OTR) 
of the fermenters was unable to meet the oxygen uptake rate (OUR) of the bacteria at a cell density greater 
than that corresponding to an OD550 of 50 under the conditions described, dOT in the fermenter at cell densities 

20 greater than this was maintained at 50% air-saturation by restricting bacteria oxygen uptake rate. This was 
achieved by formulating the medium to become carbon-limited at OD 650 of 50 and then supplying a feed of 
the limiting carbon source, together with ammonium sulphate and yeast extract, at a rate which restricted bac- 
terial growth rate. 

Fermentations were performed for 16h and during that time samples were taken for measurement of optical 
25 density (ODsso). cell dry weight and accumulation of G-CSF within the cells. G-CSF accumulation was 
measured by scanning Coomassie blue stained SDS-PAGE gels of whole cell lysates of the sampled bacteria 
as is well known in the art 

When ODsso reached 25, casein hydrolysate solution (1 OOg/1 Oxzoid L41) was pumped into the fermen- 
ters at a rate of 1 .5g/1/h. 

so When ODsso reached approximately 50, the supply of carbon-source in the fermentation batch became 
exhausted leading to a rapid rise in dOT from 50% air saturation. At this point, a feed containing glycerol 
(470g/1), yeast extract (1 18g/1) and ammonium sulphate (118g/1) was pumped into the fermenters at a rate 
which returned and then maintained the dOT at 50% air saturation with the fermenter stirred at oa 80% of its 
maximum. After ca 13-14h this fed-batch feed was replaced with a second feed containing glycerol (715g/L) 

35 and ammonium sulphate (143g/L) only. Casein hydrolysate feeding was maintained at 1.5g/Uh throughout 
After approximately 1 6 hours, when microscopic examination of the culture showed the presence of large inclu- 
sion bodies within a majority of the cells, bacteria were harvested on a Sorval RC3B centrifuge (7000g, 30 
min., 4°C) and stored frozen at minus 80°C. 
f) Purification 

40 Frozen cell paste (500g) was resuspended at 4°C in 50mM Tris HCi, 25mM EDTA, pH8.0 (5 litres) using 
a SOverson model AXR homogeniser. The suspension was lysed by passing three times through a Manton- 
Gaulln homogeniser at 6000psi and centrifuged at 5000xg for 30 minutes in a Sorvall RC3C centrifuge using 
a H6000A rotor. The supernatant was discarded and the pellet fraction stored at -20°C before further purifi- 
cation. 

45 The pellet fraction (60-1 OOg) was thawed and resuspended in 1 % w/v deoxychollc acid (sodium salt) in 5mM 
EDTA, 5mM dtthiothreitol, SOmM Tris HCI, pH9.0 (1200ml) containing 1mg/nH of sodium azide using a Polyt- 
ron homogeniser with a PTA 20 probe at speed setting 5. The suspension was mixed for 30 minutes at room 
temperature and centrifuged at 6500xg for 30 minutes in a Sorvall RC 5C centrfigure using a GSA rotor. The 
supernatant was discarded and the pellet was retreated twice in the same manner. The pellet was next twice 

50 resuspended in water (1 litre) and centrifuged at 1 5,OOOxg for 20 minutes. The final pellet containing washed 
inclusion bodies was soiubilised in 2% w/v N-lauroyl sarcosine sodium salt (Sarkosyi) in 50mM Tris. HCI, pH 
8.0 (150ml) containing 1mg/ml sodium azide. Cupric sulphate was added to 20uM and the mixture stirred for 
1 6 hours at 20°C before centrifugation at 30,OOOxg for 30 minutes in a Sorvall RC5C centrifuge using a SS34 
rotor. The supernatant containing the derivative was stored at -20°C in 50rrd aliquots before further purification. 

55 Soiubilised derivative (20ml) was thawed and passed through a 5um filter to remove any particulate ma- 
terial. The filtrate was applied to a column (5 x 90 cm) of Ultrogel AcA54 equilibrated with 0.3% w/v N-Jauroyi 
sarcosine (sodium salt) in 50mM Tris. HCI, pH 8.0 containing 1mg/ml sodium azide at 4°C. The column was 
eluted with the same buffer at a flow rate of 2.5 ml/minute and fractions of 1 0ml were collected. Fractions oon- 
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tainlng the derivative protein were pooled (approximately 100ml) and stored at 4°C. 

Pooled derivative-containing fractions from several columns were combined (300-500m!) and dialysed 
against 10mM sodium phosphate, 150mM sodium chloride pH 7.4 (3-5 litres) containing 1mg/ml sodium azide 
using an Amicon CH2A-1S spiral cartridge diafiltration apparatus equipped with a S1Y10 membrane (10kD 

5 cut-oft). The retentate was centrifuged at 30,OOOxg for 30 minutes in a Sorvali RC5C centrifuge using an SS34 
rotor, and the supernatant dialysed in Spectropor 6-8kD cut-off dialysis tubing for 40 hours against three 
changes (8 litres/300ml of supernatant) of 20mM sodium acetate, 100mM sodium chloride, pH 5.4 containing 
1mg/ml sodium azide. The precipitate which formed was removed by centrifugation at 30,OOOxg for 30 
minutes and the supernatant dialysed for 24 hours against water containing 1mg/ml sodium azide followed by 

10 72 hours against six changes of water. The final retentate was clarified by centrifugation at 30 t OOOxg for 30 
minutes and stored frozen at -20°C (protein concentration about 1mg/ml) or at 4°C after freeze drying. 

The concentration of N-lauroyt sarcosine (sodium salt) had fallen to below 0.001% w/v after diafiltration 
and was below the limit of detection (about 0.0001%) of the rpHPLC method used after dialysis againstwater. 

15 Example 2 

Preparation of [Ser 17 - 27 ] human G-CSF 

The procedure described in Example 1 was repeated except as follows:- 

20 The duplex I was phosphorylated with T4 polynucleotide kinase and digested with Mstll (10 units) in 1 X 
H buffer (BCL; 30|il) for 2 hours at 37°C. 

Following precipitation with ethanol, the 143 bp EcoRI-Mstll fragment was purified on a 10% polyac- 
rylamide gel containing 7M urea, isolated by electrocution from a gel slice and the DNA strands annealed as 
described in Reference Example 1. 

25 The synthetic EcoRI-Mstll fragment described above was cloned into the plasmid vector pAG88 described 
in Reference Example 1. For vector preparation, pAG88 (10ug) was digested with Mstll (20 units; BCL) in 1 
X H buffer (BCL; 1 00 ul) for 2 hours at 37°C. The DNA was precipitated with ethanol from 0.3 M sodium acetate 
at-20°C then digested with EcoRI (20 units; BCL) in 1 X H buffer (BCL; 100 uJ) for 2 hours at 37°C. Following 
precipitation with ethanol, the large EcoRI-Mstll fragment was purified on a 1% agarose gel and purified using 

30 Geneclean (trademark) as described by the manufacturer (Bio 101, USA). Colonies containing the synthetic 
fragment were confirmed by screening with a radioactive probe prepared from oligonucleotide (SEQ ID No 24) 
and the correct sequence confirmed by DNA sequencing as described in Reference Example 1. The plasmid 
containing the gene for [Ser^G-CSF was designated plCI1 107. The gene was cloned into expression vector 
plCI0020 and fermentation and protein purification was effected as described in Example 1. 

35 

Example 3 

Preparation of [Arg 11 Ser 17 ^ 60 * 85 ] human G-CSF 

40 The procedure described in Reference Example 2 was repeated using the mutagenic template M13mp18 
containing the gene for [Ser 17 - 27 ]G-CSF described in Example 1 or 2. The mutagenic oligonucleotides used are 
designated SEQ 1D No 28 and SEQ 1D No 29 (as hereinafter defined). 

The triplet ACG in SEQ 1D No 28 serves to convert Gin at position 11 to Arg and the first and last AGA 
triplets in SEQ ID No 29 serve to convert Pro at positions 65 and 60 to Ser. The mutagenesis was carried out 

45 as described in Reference Example 2 using SEQ ID No 29 in a single priming mutagenesis. This yielded a single 
plaque which incorporated the Pro 60 Ser and Pro 65 Ser changes. Single stranded DNA was prepared from 
this plaque as described in Reference Example 2. This DNA was used as a mutagenic template in a single 
priming mutagenesis using SEQ ID No 28 as mutagenic primer. This yielded >100 plaques, 3 of which were 
screened by DNA sequencing as previously described. All 3 had the full set of changes incorporated. Double 

so - stranded RF DNA was prepared from one of the plaques by following the procedure for large scale preparation 
of single stranded DNA (step d In Example 1) to step B5. The RF DNA was extracted from the bacterial pellet 
by the alkali lysis procedure of Birnboim and Doly (Nucleic Acids Research (1979) 7, 1513-1523) and purified 
by caesium chloride density gradient centrifugation as described in "Molecular Cloning - a Laboratory Manual* 
by Sambrook, Fritsch and Manlatis (Cold Spring Harbor Publication). The purified RF DNA was digested with 

55 EcoRI and Sail in buffer H as described previously and the small fragment, containing the trp promoter, ribo- 
some binding site, translation initiation codon and gene for [Arg^Seriw^lG-CSF isolated from a 0.7% 
agarose gel by use of Geneclean (TM). The fragment was iigated into an EcoRI-Sall digested plCI0020 vector, 
using a 2:1 molar excess of insert to vector, with T4 DNAIigase (BRL) and ligase buffer, essentially as described 
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previously. The ligation mix was used to transform E.Coli strain HB101. Transfonmants were selected for by 
growth on L-agar plates containing 50ug/ml ampicillin. Colonies were screened for the presence of the inserted 
DNA by restriction analysis of plasmid DNA prepared by the method of Bimboim and Ooty as described in 
"Molecular Cloning - a Laboratory Manual" Sambrook, Fritsch and Maniatis (Cold Spring Harbor Publication). 
Plasmid DNA from a colony containing the expected 619bp EcoRI - Sail insert was used to transform Exgjj 
strain MSD522 and designated plCI1239. Fermentation and purification were effected as described in Example 
1. 

Example 4 

Preparation of [SeMWu^GJu 111 ] human G-CSF 

The procedure described in Example 3 was repeated using the mutagenic template M13mp18 containing 
the gene for [Ser^^G-CSF described in Example 1 or 2. The mutagenic oligonucleotide used is designated 
SEQ ID No 30 (as hereinafter defined) 

The triplet QCT serves to convert Thrat position 1 16 to Ser, the triplet AGA serves to convert Thr at position 
115 to Ser and the triplet TTC serves to convert Ala at position 111 to Glu. The mutagenesis procedure was 
essentially as described for Example 3 and the expression cassette was transferred to the expression plasmid 
to give pld 1243. Fermentation and purification was effected as described in Example 1. 

Examples 

Preparation of [Arg", ser^, Lys« Arg"*] human G-CSF 

The procedure described in Example 3 was repeated using the mutagenic template M13mp18 containing 
the gene for [Ser^-^G-CSF described in Example 1 or 2. The mutagenic oligonucleotides used are designated 
SEQ ID No 28, SEQ ID No 31 and SEQ ID No 32 (as hereinafter defined) 

The triplet TTT in SEQ ID No 31 serves to convert Trp at position 58 to Lys and in SEQ ID No 32 the second 
GCG triplet serves to convert Tyr at position 165 to Aig. 

The mutagenesis procedure was initially carried out as a double priming experiment using SEQ ID No 31 
and SEQ ID No 32 as mutagenic oligonucleotides as described for Reference Example 2. This yielded 2 plaques 
both of which had the SEQ ID No 32 change (Tyr 165 Arg) but not the SEQ ID No 31 change. Single stranded 
DNA was prepared from one of these plaques as described in Example 1. This DNA was used as a mutagenic 
template in a double priming mutagenesis using SEQ ID No 28 and SEQ ID No 31 as mutagenic primers. This 
yielded 2 plaques one of which had the complete set of changes incorporated and the expression cassette was 
transferred to the expression plasmid to give pICI 1246. Fermentation and purification was effected as des- 
cribed in Example 1. 

Example 6 

Preparation of [Glu 15 , Ser 17 .* 7 , Ala 26 * 28 , Lys 30 ] human G-CSF 

a) The procedure described in Example 3 was repeated using the mutagenic template M 1 3mp1 8 containing 
the gene for [Sef^G-CSF described in Example 1 or 2. The mutagenic oligonucleotides used are designated 
SEQ ID No 33 and SEQ ID No 34 (as hereinafter defined). 

The triplet TTC in SEQ ID No 33 serves to convert Leu at position 1 5 to Glu. In SEQ ID No 34 the first TTT 
triplet serves to convert Ala at position 30 to Lys and the triplets AGC serve to convert Gly at position 28 and 
26 to Ala, 

The mutagenesis procedure was essentially as described in Reference Example 2 as a double priming ex- 
periment and the expression cassette transferred to the expression plasmid to give pIC1 1266. Fermentation 
was effected as described in Example 1. 

b) Purification 

Frozen cell paste was lysed and the crude pellet fraction separated as in Example 1 . The inclusion bodies 
in the pellet containing this protein were solubilised by the deoxycholic acid (sodium salt) buffer described in 
Example 1. The following modified procedure was used for this protein. 

Crude pellet fraction (60-100g) was thawed and resuspended in 25mM EDTA, SOmM Tris.HCI, pH 8.0 
(1200ml) using a Polytron homogeniser with a PTA 20 probe at speed setting 5. The suspension was mixed 
at room temperature for 30 minutes and centrifuged at 6,500 x g for 30 minutes in a Sorvail RC5C centrifuge 
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using a GSA rotor. The supernatant was discarded and the pellet retreated twice in the same manner. The pellet 
was next twice resuspended in water (1 litre) and centrifuged as in Example 1. Thereafter the purification pro- 
cedure was as in Example 1. 

5 Example 7 

Preparation of [Ser 17 ^ 7 Lys*-* 8 Ala"* 1 * 5 ! human G-CSF 

The procedure described in Example 3 was repeated using the mutagenic template M13mp18 containing 
10 the gene for [Ser^G-CSF described in Example 1 or 2. The mutagenic oligonucleotides used are designated 
SEQ ID No 35 and SEQ ID No 36 (as hereinafter defined). In SEQ ID No 35 the triplets AGC serve to convert 
Gly to Ala at position 51 and Pro to Ala at position 44 and the triplet TTT serves to convert Leu to Lys at position 
49. In SEQ ID No 36 the triplet TTT serves to convert Trp to Lys at position 58 and the second AGC triplet 
serves to convert Gly to Aln at position 55. 
15 The mutagenesis was carried out as a double priming experiment as described in Reference Example 2. This 
yielded 16 plaques. 8 Plaques were screened by DNA sequencing as described in Example 3. All plaques had 
the SEQ ID No 36 changes (Gly55Ala, Trp58Lys) but none had the SEQ ID No 35 changes. Single stranded 
DNA was prepared from one of these plaques as described in Example 1 (d) and used as a mutagenic template 
In a single priming mutagenesis using SEQ ID No 35 as mutagenic primer. This yielded 50 plaques, 3 of which 
20 were screened by DNA sequencing, 2 had the complete set of changes. The expression cassette was trans- 
ferred to the expression plasmid to give pIC1 1297. Fermentation and purification was effected as described in 
Example 1. 

Example 8 

25 

Preparation of [Arg^.GIu^.Ser^^^la^.Lys 30 ] human G-CSF 

The procedure described in Example 3 was repeated using the mutagenic template M13mp18 containing 
the gene for [Glu 15 ,Ser 17 ' 27 ,Ala 2 « 8 . Lys 30 ] human G-CSF described In Example 6. The mutagenic oligonuo. 

30 leotide used is designated SEQ ID No 28 which serves to convert Gin at position 1 1 to Arg. The modified gene 
was isolated and ligated into plCI0020 vector (Example 1). This vector was used to transform E^nM strain 
MSD522 as described In Example 3 and designated plCI1347. plCI1347 plasmid DNA was isolated from 
MSD522, purified by caesium chloride density centrifugation and digested to completion with BamHI and Sail 
(BCL) Plasmid DNA (5 ug) was incubated at 37°C for 2 hours in BCL high salt buffer (100 ul) (50 mM tris Ha 

36 pH 7.5, 10 mM MgCI 2 , 100 mM NaCI, 1mM dithioerythritol) containing BamHI (40 units) and Sail (50 units). 
The DNA was precipitated by addition of 3M sodium acetate (1 0 ul) and absolute ethanol (250 ui) and cooling 
to-20°C1br2 hours, collected by centrifugation (10 min at 10,000 rpm), dried in vacuo and dissolved in water 
(10 ul). Sample loading buffer (2 ul containing 240 mM tris acetate pH 7.8, 6 mM EDTA, 20% sucrose, 0.2% 
xylene cyanol and 0.2% bromophenol blue ) was added and the mixture loaded onto a 0.7% agarose prep- 

40 arative gel (In 40 mM tris acetate (pH 7.8) and 1 mM EDTA) containing ethidium bromide (0.5 ng/m!) and 
electrophoresed at 100 volts for 1 hour. The large BamHI - Sail vector fragment was isolated from a 0.7% 
aaarose gel by use of Geneclean (trademark), in a similar manner, plCI1239 plasmid DNA from Example 3 
was isolated and digested with BamHI and Sail. The small BamHI - Sail fragment, containing the Ser codons 
at position 60 and 65, was isolated and ligated to the large Baml - Sail vector fragment described above. The 

45 mixture was used to transform E.coli strain MSD522 and the plasmid designated plCI1348. Fermentation and 
purification was effected as described in Example 6. 

Example 9 

50 The procedure of Examples 1 and 2 was repeated using E.coli strain TG1 instead of Exoli strain MSD 522 
in the fermentation step (see for example Example 1(e)). 

Example 10 

55 Alternative Extraction Process for Human [Met- 1 Arg" Ser^^G-CSF 

Frozen cell paste (640 g) was resuspended at 4°C in 50mM Tris HCI, 5mM EDTA, 5mM dithiothreitol, 2M 
urea, pH 8.0 containing 1 mg/ml sodium azide (5 litres) using a Polytron homogeniser with a PTA20 probe at 
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speed setting 7/8. The suspension was lysed by passing three times through a Manton-Gaulin Lab 60/60 
homogeniser at 6000 psi and flushed through with a further 1 litre of buffer. Coding was provided by a single 
pass Conair chiller at -20°C. The lysate was centrifuged at 5000 xg for 30 minutes in a Sorvall RC3C cen- 
trifuge using an H6000A rotor. 

5 The supernatant was discarded and the pellet (about 450 g) was resuspended in the same buffer (10 lit- 
res). The suspension was mixed for 30 minutes at room temperature and centrifuged at 5000 rpm for 30 
minutes in two Sorvall RC3C centrifuges using H6000A rotors, the supernatant was discarded and the pellet 
retreated twice in the same manner. The pellet was next twice resuspended in water (10 litres) and centrifuged 
at 5000 rpm for 30 minutes. The final pellets containing washed inclusion bodies were resuspended in 2% 

10 w/v N-lauroyl sarcosine sodium salt in 50mM Tris HCI, pH 8.0 (1 litre) containing 1 mg/ml sodium azide using 
a Polytron homogeniser at speed setting 7. 20 mM cupric sulphate in water (1 .5 ml) was added and the mixture 
stirred overnight at room temperature before centriftigation at 10,000 rpm for 30 minutes in a Sorvall RC5C 
centrifuge using a GSA rotor. 

The supernatant containing the derivative was filtered through a 5um filter to remove any particulate matter, 

15 diluted six-fold with 50 mM Tris HCI, pH 8.0 containing 1 mg/ml sodium azide at 4°C, and diafOtered at 
maximum pressure in an Amicon DC20 ultrafiltration device fitted with a S10Y10 cartridge (10 kd cut-off) 
against 10 mM sodium phosphate, 150 mM sodium chloride pH 7.4 (90 litres) containing 1 mg/ml sodium 
azide. A precipitate formed towards the end of the dlafiltration. 

The retentate (2.1 mg/ml total protein, 1 .7 mg/ml product) was collected in 4 litre, screw top, polypropylene 

20 containers and incubated overnight at 37°C. The precipitate which formed was removed by centrifugation at 
5000 rpm for 45 minutes in a Sorvall RC3C, and the supernatant stored at 4°C. 

Monitoring by SDS-PAGE and rpHPLC, showed that during the final heat treatment contaminating E. coH 
proteins, product oligomers, and degradation products were selectively precipitated, with some 85% of the des- 
ired product remaining in solution. The highly enriched clarified, heat treated product solution was fully biologi- 

25 cally active and stable at 20 mg/ml at 37°C over two weeks with no evidence of proteolytic degradation and 
less than 20% precipitation. This provided an excellent intermediate for further chromatographic purification. 

Example 11 

30 Characterisation of G-CSF and derivatives thereof 

A water solution of [Met- 1 , Ser 17 ] G-CSF and derivatives thereof (Examples 1-9) (protein concentration 
about 1mg/m!) were concentrated to at least 11 mg/ml of protein on an Amicon YM10 membrane at 4°C. To 
prevent any precipitation during concentration, the starting solution pH5.5 was first adjusted to pH8.5 by the 

35 addition of ammonium hydroxide to a final concentration of about 0.25mM. After concentration the pH of the 
solution had fallen to about 8.0. 

The concentrated protein solution was adjusted to 1 0mg/mJ protein (estimated from a 1 mg/ml solution giv- 
ing an A280 of 1.0) by addition of 20 fold concentrated phosphate buffered saline. This 10mg/ml solution of 
derivative In 10mM sodium phosphate, 150mM sodium chloride, pH7.4 (PBS) provided a common stock sol- 

40 ution from which to establish homogeneity, identity, biological activity and solution stability of the protein. 
A stock solution of human G-CSF at 1 mg/ml concentration in PBS prepared as described in Reference Example 
1 was also prepared. 

Each protein was shown to be at least 95% one component by PAGE-SDS run under reducing and non- 
reducing conditions and by reverse phase HPLC. Repeated amino acid composition analysis after acid hyd- 
45 roiysis in 6NHCI at 1 1 0°C provided amino acid ratios for each derivative, and an accurate measurement of the 
protein concentration in the stock solution. This protein concentration together with the mean of bioassay titres 
obtained on at least six different days was used to determine the specific activity of the derivative. N-terminal 
sequence analysis and electrospray mass spectrometric analysis of selected derivatives gave the expected 
sequences and molecular weights. 

so 

Example 12 

Preparation of [Arg 11 ,Ser 17 ' 27 ' 60 ' 65 ]human G-CSF using production vector including trp promoter 

55 a) Plasmid ptCI1239 (described in Example 3) was digested with EcoRI and Sail In buffer H as described 
previously. Trie smalt EcoRI-Sali fragment containing the trp promoter, ribosome binding site and gene for 
[Arg 11 ,Ser 17 « 27 ' 60 ' 65 ]hu G-CSF was isolated from a 0.7% agarose gel by use of GenedeanfTM). A vector frag- 
ment was prepared from pICI O080 (see Reference Example 6) by digestion with EcoRI and Xhol in buffer H 
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and the large EcoRI-Xhol fragment isolated from a 0.7% agarose gel by use of Geneclean(TM). The small Eco- 
Rl-Sall fragment was llgated Into the EcoRI-Xhol vector fragment, using a 2:1 molar excess of Insert to vector 
as described previously and the ligation mix used to transform E. coli strain MSD 522. Transformants were 
selected for growth on L-agar plates containing tetracycline (15jig/ml). Three colonies were selected and grown 
up in M9 minimal media (75ml) containing supplements and tetracycline (15jig/ml) at 37°C for 20 hours on a 
reciprocating shaker. Protein accumulation was measured by scanning Coomassie blue stained SDS-PAGE 
gels of whole cell lysate. All three clones expressed [Arg^Ser^w^hu G-CSF. Plasmid DNAfrom one of 
the colonies was designated plCI1327 and the sequence of the promoter and gene confirmed by standard 
dideoxy sequencing procedures as described previously. 
b) Fermentation 

pIC1 1327 was transformed into E. coli strain MSD 522 and the resultant recombinants purified and main- 
tained on glycerol stocks at -80°C. 

An aliquot of the culture was removed from stock and streaked onto agar plates of tetracycline to separate 
single colonies after overnight growth at 37°C. A single desired colony was removed and resuspended in 10 
ml tetracycline broth and 100yl immediately inoculated into each of 3 250 ml Erlenmeyer flasks containing 
75 mi tetracycline broth. After growth for 1 6h at 37°C on a reciprocating shaker the contents of the flasks were 
pooled and used to inoculate a fermenter containing 20L growth medium. 

Composition of Growth Medium 





Made up of distilled water 






KH 2 P0 4 


3.0 


Na 2 HP0 4 


6.0 


KaCl 


0.5 


Casein hydrolysate (Oxoid L41) 


2.0 


<NH 4 ) 2 S0 4 


10.00 


Yeast Extract (Difco) 


10.00 


Glycerol 


35.00 


L-Leucine 


0.625 


MgS0 4 . 7H 2 0 


0.5 


CaCl r 2H 2 0 


0.03 


Thiamine 


0.008 


FeS0 4 /Citric Acid 


0.04/0.02 


Trace element solution (TES) 


0.5ml l" 1 


Tetracycline 


lOmg l" 1 



Fermentations were then carried out at a temperature of 37°C, and at a pH, controiied by automatic addition 
of 6M sodium hydroxide solution, of pH 6.7. The dissolved oxygen tension (dOT) set point was 50% air-satu- 
ration and was initially controlled by automatic adjustment of the fermenter stirrer speed. Air flow to the fer- 
menter, initially 20L7min f corresponding to 1 volume per volume per minute (WM) was increased to 501/min 
(2.5 WM) when the fermenter stirrer speed approached 80-90% of its maximum. Since the oxygen transfer 
rate (OTR) of the feimenters was unable to meet the oxygen uptake rate (OUR) of the bacteria at a cell density 
greater than that corresponding to an OD550 of SO under the conditions described, dOT in the fermenter at cell 
densities greater than this was maintained at 50% air-saturation by restricting bacteria oxygen uptake rate. 
This was achieved by formulating the medium to become carbon-limited at OD550 of 50 and then supplying a 
feed of the limiting carbon source, together with ammonium sulphate and yeast extract, at a rate which restricted 
bacterial growth rate. 
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Fermentations were performed for 1 8h and during that time samples were taken for measurement of optical 
density (OD^o). cell dry weight and accumulation of [Arg» Ser^^^Jhuman G-CSF within the cells. 
[Arg^.Ser^^^human G-CSF accumulation was measured by scanning Coomassie blue stained SDS- 
PAGE gels of whole cell lysates of the sampled bacteria as is well known in the art When OD^o reached 35 
5 (8£h), casein hydrotysate solution (100g/1 Oxzord 141) was pumped into the fermenters at a rate of 0.75g/l/h. 

When ODsso reached approximately 50, the supply of carbon-source in the fermentation batch became 
exhausted leading to a rapid rise in dOT from 50% air saturation. At this point, a feed containing glycerol 
(470g/1), yeast extract (118g/1) and ammonium sulphate (118g/1) was pumped into the fermenters at a rate 
which returned and then maintained the dOT at 50% air saturation with the fermenter stirrer at ra 70-80% of 
10 its maximum. Casein hydrolysate feeding was maintained at 0.75g/I/h throughout After approximately 18 
hours, when microscopic examination of the culture showed the presence of large inclusion bodies within a 
majority of the cells, bacteria were harvested on a Sorval RC3B centrifuge (7000g, 30 min., 4°C) and stored 
frozen at minus 80°C. 

c) Purification 

is Purification was effected as described in Example 1(f) 
Example 13 

Preparation of [Arg^.Ser^^^^lhuman G-CSF using production vector including T7A3 promoter 

20 

a) An EcoRl-Sali fragment, containing a T7A3 promoter, a trp leader ribosome binding site sequence and 
a gene for [Ser^^hu G-CSF was sub-cloned into M13 mp1 8 as described in part d) of Example 1. The sequ- 
ence of the EcoRI-Sall fragment is set out in SEQ ID No 50 and Figure 3, SEQ ID No 50 consists of the EcoRl 
restriction site (nucleotides 1-6), the A3 promoter sequence of bacteriophage T7 (nucleotide 7-52), the trp 

25 leader ribosome binding site sequence (nucleotides 53-78)and translation initiation codon (nucleotides 79-81). 
Figure 3 sets out the nucleotide sequence of [Ser'^human G-CSF terminating in the Sail restriction site. It 
will be appreciated that the 3 r terminal ATG codon of SEQ ID No 50 immediately precedes the ACT codon 
which codes for threonine (amino acid 1) in Figure 3. The 5' nucleotide sequence AATTCAGT is thus absent 
from the EcoRl-Sali fragment The EcoRl-Sali fragment may also be prepared by excision from pIC1 1295 (see 

30 Reference Example 7). Site-directed mutagenesis was performed on single-stranded DNA as described in 
Reference Example 2 using oligonucleotide SEQ ID No 28 to convert the codon for Gin at position 11 to Arg. 
Double-stranded RF DNA was prepared from a plaque containing the Gln 11 ->Arg 11 change as described in 
Example 3, except that at step B3 incubation was for 3 hours instead of 5 hours, and digested with EcoRl (as 
described previously) and SnaBI (as described in Reference Example 5). The resulting 144 bp EcoRI-SnaBI 

35 fragment containing the T7A3 promoter, trp leader ribosome binding site sequence and gene fragment with 
Arg 11 codon was isolated and ligated to an EcoRI-SnaBI cut vector from pIC1 1327 (which contains codons for 
Set 40 and Ser* 5 and is described in Example 12). The ligation mix was used to transform E.ooli strain MSD522 
and transformants selected for growth on L-agar plates containing tetracycline (15ug/mg). Plasmid DNA from 
a colony containing the expected T7A3 promoter and [Arg 11 , Ser 17 * 27 - 60 * 65 ] hu G-CSF gene sequence were iden- 

40 tified by sequencing DNA from the isolated plasmid and designated pld 1386. 

The fermentation was effected according to two alternative processes (b) and (c) below. Process (b) was 
effected at 37°C and after 16 hours fermentation as described, microbial biomass was 35 g/I and 
[Arg^.Se^w^o^Jhuman G-CSF was estimated to be accumulated to 7g/l fermentation broth. Process (c) was 
effected at 30°C and the fermentation was accordingly slower because of the lower fermentation temperature. 

45 With regard to process(c), after 35 hours, the microbial biomass was 55 g/I and the [Arg^Ser^^^-^human 
G-CSF yield was estimated to be accumulated to 15 g/l feimentation broth. 

(b) E.C0II strain CGSC 6300 (genotype P , AT, lac+) obtained from the Exojj Genetic Stock Centre was trans- 
formed with plasmid pIC1 1386. The resultant strain CGSC 6300 (pIC1 1386) was purified and maintained in 
glycerol stocks at -80°C. An aliquot of the culture was removed from stock and streaked onto agar plates of 

50 L-tBtracydine to separate single colonies after overnight growth (16h) at 37°C. 

A single colony of CGSC 6300 (pIC1 1386) was removed and resuspended in 10ml L-tetracycIine broth and 
lOOji! immediately inoculated into each of twenty 250ml Erienmeyer flasks containing 75m! of L-tetracycline 
broth. After growth for 16h at 37°C on a reciprocating shaker the contents of the flasks were pooled, and used 
to inoculate a fermenter containing 20 litres of modified LCM50 growth medium. The composition of the growth 

55 medium is in Table 1. 



18 



EP0 459 630 A2 



TABLE 1: Composition of growth medium 
Modified LCM50 Growth Medium (A) 

made up with distilled water 
S/1 



10 ¥E 2 ?0 A 3<0 

Na 2 HP0 4 6.0 

NaCl 0.5 

Casein Hydrolysate (Oxoid L41) 2-0 

(NH 4 ) 2 S0 4 10.0 

Yeast extract (Difco) 20.0 

Glycerol 35.0 

20 MgS0 4 .7H 2 0 0.5 

CaCl 2 .2H 2 0 0.03 

Thiamine 0.008 

FeSO, /Citric acid 0.04/0.02 

25 4 - 

Trace element solution(TES) (0.5ml 1" ) 

Tetracycline (10 rag IT 1 ) 



30 The fermentation was then carried out at a temperature of 37°C and at a pH, controlled by automatic addition 
of 6M sodium hydroxide solution, of pH 6.7. The dissolved oxygen tension (601) set point was 50% air satu- 
ration and was initially controlled by automatic adjustment of the fermenter stirrer speed. Air flow to the ter- 
menter was initially 20 LAmin corresponding to 1 .0 volume volume per minute (WM) and was increased to 45 
L/min manually when the fermenter stirrer speed reached its maximum (1000 rpm). The fermentation was per- 

35 formed tor 1 6h and during that time samples were taken for measurement of optical density of the culture (OD&o 
biomass concentration, total microbial protein concentration and accumulation of [Arg 11 ,Ser 1 w.«o,65]h UTnan g- 
CSF within the bacterial cells. Accummulation was measured by scanning Coomassie blue stained SDS-PAGE 
gels of whole cell lysates of the sampled bacteria as is well known in the art Total microbial protein was esti- 
mated by the method of Lowry. A solution of yeast extract (225 g/L) was pumped into the fermenter 4.5h post 

40 inoculation at 1 .7 g/L/h. 

When the supply of carbon source (glycerol) in the growth medium became exhausted dOT increased rapi- 
dh'from 50% air saturation. At this point a feed containing glycerol (714 g/l) and ammonium sulphatB(143g/L) 
was pumped. Since the bacterial oxygen sulphate rate (OUR) approached the maximum oxygen transfer rate 
of the fermenter (OTR) just prior to the carbon source in the batch growth medium becoming exhausted, the 

45 feed was pumped into the fermenter at a rate which restricted the bacterial OUR to approximately 80-90% of 
the fermenters maximum OTR, The feed rate was adjusted manually to return and then maintain dOT at 50% 
air saturation under the conditions described. 

c) The fermentation process described In (b) was repeated but at a temperature of 30°C for 35 hours. 
Except for the fermentation temperature of 3O 0 C the medium and fermentation conditions were identical to 

50 those described in (b). 

d) Purification was effected as described In Example 1(f). 

Example 14 

55 Preparation of [Glu 16 f Se^,Ala2^ > Arg30]hu G-CSF 

A mutagenic template, M1 3mp1 8 containing the gene for [Glu'* $er"* Ala»^ f Lys*>]hu GOSF, was prep- 
ared as described in part (d) of Example 1 with plasmid pIC1 1266 replacing pIC1 1080. The procedure des- 
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cribed in Example 3 was repeated using the above template with mutagenic oligonucleotide designated SEQ 
ID No.37. This serves to convert the codon for Lys at position 30 to Arg. Double stranded RF DNA was prepared 
from one phage containing the desired change. An EcoRI-Sall expresson cassette was isolated and cloned 
into pICI 0080 as described in Example 12 to give pIC1 1343. 
s Further processing to yield the title compound was effected as described in Example 3 and purification was 

effected as described in Example 6. 

Example 15 

10 Preparation of [Argn^.Ser 1 w*ws|hu g-CSF 

A mutagenic template, M13mp18 containing the gene for [Arg^Ser 17 - 27 * 60 * 65 ^ G-CSF, was prepared as 
described in part (d) of Example 1 with plasmid pIC1 1239 replacing pIC1 1080. The procedure described in 
Example 3 was repeated using the above template with mutagenic oligonucleotide designated SEQ ID No 38. 
is This serves to convert the codon for Lys at position 23 to Arg. Double-stranded RF DNA was prepared from 
one phage containing the desired change and the expression cassette isolated and cloned as described in 
Example 14 to give pIC1 1388. 

Further processing to yield the title compound and the purification of the title compound were effected as 
described in Example 1. 

20 

Example 16 

Preparation of [Arg^^Seriw^hu G-CSF 

25 The procedure described in Example 1 5 was repeated with oligonucleotide designated SEQ ID No.38 rep- 
laced by SEQ ID No.39 (this serves to convert the codon for Lys at position 34 to Arg) to give pld 1389. 

Further processing to yield the title compound and the purification of the title compound were effected as 
described in Example 1. 

so Example 17 

Preparation of [Aig"^ f Ser"^«°.«]hu G-CSF 

The procedure described in Example 1 5 was repeated with oligonucleotide SEQ ID No.38 replaced by SEQ 
55 ID No.40 (this serves to convert the codon for Lys at position 40 to Arg) to give pIC1 1 390. 

Further processing to yield the title compound and the purification of the title compound were effected as 
described in Example 1. 

Example 18 

40 

Preparation of [AlaMlira.Tyr^.Arg^i.Ser^^^sjhu G-CSF 

The procedure described in Example 1 5 was repeated with oligonucleotide SEQ ID No.38 replaced by SEQ 
ID No.41 (this serves to convert codons for Thr, Leu, Gly and Pro at positions 1 , 3, 4 and 5 to Ala, Thr, Tyr and 

45 Arg respectively to give pIC1 1 391 . 

The polypeptide of this Example illustrates that the modification of the present invention may be applied to a 
polypeptide known to possess G-CSF activity in order to improve the solution stability of the polypeptide. The 
known polypeptide is [Alai f Thr 3 ,Tyr*,Arg5,Ser 1 7]hu G-CSF which is described in European Patent Publication 
No. 272,703 of Kyowa Hakko Kogyo Co. Ltd. 

so Further processing to yield the title compound and the purification of the title compound were effected as 
described in Example 1. 

Example 19 

55 Preparation of [Arg^.Ser 17 - 27 ^ G-CSF 

The procedure described in Example 4 was repeated with oligonucleotide SEQ ID No.30 replaced by SEQ 
ID No.28 (this serves to convert the codon for Gin at position 1 1 to Arg). The expression cassette was trans- 
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ferred to expression plasmid plCI 0080, instead of pICI 002O as described In Example 14 to give pIC1 1405. 

Further processing to yield the title compound and the purification of the tide compound were effected as 
described in Example 1. 

5 Example 20 

Preparation of [Ser 1 w^o^sjhu G-CSF 

The procedure described in Example 1 9 was repeated with oligonucleotide SEQ ID No.28 replaced by SEQ 
10 ID No.29 (this serves to convert the codons for Pro at 60 and 65 to Ser) to give pIC1 1400. 

Further processing to yield the title compound and the purification of the title compound were effected as 
described in Example 1. 

Example 21 

15 

Preparation of [Arg^.Ser^^lhu G-CSF 

The procedure described in Example 6 was repeated with oligonucleotides SEQ ID No.33 and SEQ ID 
No.34 replaced by SEQ ID No.28 and SEQ ID No.42. These serve to convert the codons for Gin at position 1 1 
20 and Pro at position 60 to Arg and Ser respectively. The expression cassette was transferred to the expression 
plasmid pIC! 0080 instead of pICI O020 to give pIC1 1401. 

Further processing to yield the title compound and the purification of the title compound were effected as 
described in Example 1. 

25 Example 22 

Preparation of [Argii f Seri7^.85]hu G-CSF 

The procedure described in Example 3 was repeated with oligonucleotide designated SEQ ID No.29 rep- 
30 laced by SEQ ID No.43 (this serves to convert the codon for Pro at position 65 to Ser) to give pIC1 1418. 

Further processing to yield the title compound and the purification of the title compound were effected as 
described in Example 1. 

Example 23 

35 

Preparation of [Ser 17 ^^°]hu G-CSF 

The procedure described in Example 19 was repeated with oligonucleotide designated SEQ ID No.28 rep- 
laced by SEQ ID No.42 (this serves to convert the codon for Pro at position 60 to Ser) to give pIC1 1402. 
40 Further processing to yield the title compound and the purification of the title compound were effected as 
described in Example 1. 

Example 24 

45 Preparation of [Ser* W^Jhu G-CSF 

The procedure described in Example 4 was repeated with oligonucleotide designated SEQ ID No.30 rep- 
laced by SEQ ID No.43 (this serves to convert the codon for Pro at position 65 to Ser) to give pIC1 1420. 
Further processing to yield the title compound and the purification of the title compound were effected as 
60 described in Example 1. 

Example 25 

Preparation of [Arg 11 GIu 15 ' 111 , Ser 17 * 7 . 80 ' 85 ' 116 - 118 , Ala 28 - 28 Lys 30 ] hu G-CSF 

55 

Plasmid pIC1 1348, described in Example 8, was digested with Xbal in buffer M and then with Sail in buffer 
H and the large Xbal-Sall vector fragment Isolated from a 0.7% agarose gel as described previously. Plasmid 
pIC1 1243, described in Example 4, was digested with Xbai and Sail as described above and the small Xbal-Sall 
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fragment isolated from a 0.7% agarose gel and ligated to the Xba1-Sall vector fragment above. The ligation 
mix was used to transform Ecoli strain MSD 522 and transformants selected for growth on L-agar plates con- 
taining ampiciliin (50^g/ml). Three colonies were screened for expression of protein as described in Example 
1 2 but replacing tetracycline by ampiciliin at 50jig/ml. Plasmid DNA from a colony expressing the correct protein 
was designated pIC1 1421. 

Further processing to yield the title compound was effected as described in Example 3 and purification was 
effected as described in Example 6. 

Example 26 

Preparation of [Arg".'«s, Glu* Ser^eces Glu* 6 ^ Lys^] hu G-CSF 

A mutagenic template, M13mp18 containing the gene for [Arg^Glu 15 , Ser 47 ^.»* 5 > AIa 2 * 28 t Lys 30 ]hu G- 
CSF, was prepared as described in part (d) of Example 1 with plasmid plCl 1348 (described in Example 8) rep- 
lacing pICI 1080. The procedure described in Example 3 was repeated using the above template with 
mutagenic oligonucleotides designated SEQ ID No.28 and SEQ ID No.29 replaced by SEQ ID No.44 and SEQ 
ID No.32 (these serve to convert the codons for Trp at position 53 to Lys and Tyr at position 165 to Arg) to give 
pIC1 1422. 

Further processing to yield the title compound was effected as described in Example 3 and purification was 
effected as described in Example 6. 

Example 27 

Preparation of lAs&\G\u^SerW^JU&WM,si#> 9 Lys3o*M»]hu G-CSF 

A mutagenic template was prepared as described in Example 26. The procedure described in Example 4 
was repeated using the above template with mutagenic oligonucleotide designated SEQ ID No.30 replaced 
by SEQ ID No.45 (this serves to convert the codons for Pro at position 44, Leu at position 49 and Gly at positions 
51 and 55 to Ala, Lys, Ala and Ala respectively) to give pIC1 1423. 

Further processing to yield the title compound was effected as described in Example 3 and purification was 
effected as described In Example 6. 

Example 28 

Preparation of [Argues GIu 1 ^" Ser".2W,«.ii5,n6 f Ala^-z^^.Lys^wjhu G-CSF 

A mutagenic template was prepared as described in part (d) of Example 1 with pIC1 1080 replaced by pICI 
1423, described In Example 27. The procedure described in Example 3 was repeated using the above template 
and oligonucleotides designated SEQ ID No.28 and SEQ ID No.29 replaced by SEQ ID No.32 and SEQ ID 
No.30 to give pIC1 1424. 

Further processing to yield the title compound was effected as described in Example 3 and purification was 
efft jted as described in Example 6. 

Reference Example 1 

Preparation of human G-CSF 

a) Preparation of a synthetic gene for human G-CSF 

A DNA sequence (Figure 2) encoding the amino-acid sequence of the polypeptide of Figure 2 (human G- 
CSF) was designed according to the following considerations: 

1) Single - stranded cohesive termini to allow ligation at suitable sites In a plasmid. 

2) A series of restriction endonucfease sequences throughout the gene to facBltate subsequent genetic 
manipulation. 

3) Translation termination codon. 

4) Codons at the 5'-end of the coding region were normally chosen to be AH* rich. Other codons were nor- 
mally chosen as those preferred for expression in E.coll. 

The gene was assembled from the 1 8 oligonucleotides designated SEQ ID No.1 - SEQ ID No.1 8 and shown 
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hereinafter. 

Preparation of Oligonucleotides 

5 The oligonucleotide sequences shown hereinafter were prepared on an Applied Btosystems 380A DMA 
synthesiser from 5'-dimethoxytrityl base-protected nudeoside-2-cyanoethyl-N l NKJHsopropytphosphorarnldites 
and protected nucleosides linked to controlled-pore glass supports on a 0.2 micro mol scale, according to pro- 
tocols supplied by Applied Biosystems Inc. 

Alternatively, the oligonucleotide sequences may be prepared by manual methods as described by Atidn- 

10 son and Smith in 'Oligonucleotide Synthesis, a Practical Approach' (M. T. Gait, Editor, IRL Press, Oxford, 
Washington DC, pages 35-81). 

In detaS, the preparation of the oligonucleotide sequences by use of the Applied Biosystems 380A DMA 
synthesiser was effected as foliows:- 

Each oligonucleotide, after cleavage from the solid support and removal of all protecting groups, was dis- 

15 solved in water (1ml). A solution of 3M sodium acetate (pH5.6; 40uJ) and ethand (1ml) was added to the 
oligonucleotide solutions (400jil) and the mixtures stored at -70°C for 20 hours. The resulting precipitates 
were collected by centrifiigation (13,000rpm for 10 minutes) and the pellets washed with ethand water (7:3) 
(200*0) then dried briefly in vacuo and dissolved in water (1SpJ) and 10^1 of a formamide/dye mix. (10mM 
NaOH, 0.5mM EDTA, 0.01% Bromophenol Blue, 0.01% xylene cyanol, 80% formamide. 

20 The oligonucleotides were purified on a 10% polyacrylamide gel in SOmM Tris-borate (pH8.3) containing 
8.3Murea. 

" Oligonudeotides of correct length were identified by UV shadowing (Narang et al, 1979 in Methods In Enzymol- 
ogy Vol 68, 90-98) - normally the most prominent band - excised from the gel and electrocuted in 5mM tris- 
borate (pH 8.3) at 300mV for 3-4 hours. The aqueous solutions were concentrated to about 200ji) by treatment 
25 with n-butanoJ (mix, spin and removal of the upper organic layer). The purified oligonudeotides were precipi- 
tated at -70°C for 20 hours from a 0.3M sodium acetate solution by addition of ethanol (2.5 volumes). 

Assembly of gene 

30 Oligonucleotides SEQ ID No2 - SEQ ID No 17 (400pM of each) [as defined hereinafter] were phosphory- 
lated with T4 pdynudeotide kinase (3.6 units) for 2 hours at 37°C in 25*il of a solution containing ATP (800pM 
containing 25pM gamma- 32 P ATP). lOO^M spermidine, 20mM MgCI* SOmM Tris-HCI (pH9.0) and O/ImM 
EDTA. The solutions were heated at 100°C for 5 minutes to terminate the reactions, then mixed in pairs as 
shown in Table 1 to give duplexes A to I (Oligonucleotides SEQ ID No 1 and SEQ ID No 18 (400mM In 25pl) 

35 were used unphosphorylated). 0.3M Sodium acetate (pH5.6. 200^0 and ethanol (850jd) were added and the 
duplexes precipitated at -20°C for 20 hours. The resulting precipitates were collected by centrifugation and 
washed with ethanolrwater (7:3) then dissolved in water (50^1). The pairs of oligonudeotides were annealed 
together by first heating the solutions to 1 00°C for 2 minutes in a boiling water bath. The bath was then allowed 
to cool slowly to 40°C (about 4 hours). Solutions containing 3 pairs of duplexes were combined as shown (see 

40 Table 1), to give groups I to III lyophilised and dissolved in 30pl of a solution containing T4 DMA ligase (1 unit; 
BRL), 50mM Tris (pH7.6), 10mM magnesium chloride, 5% (w/v) PEG 8000, 1mm ATP, 1mm DTT. (BRL, 
Pocus Vol 8 no 1 Winter 1986) and the DNA ligated at 30°C for 5 minutes followed by 20 hours at 16°C. 3M 
Sodium acetate (20*il) and water (150jJ) was added and the product precipitated by addition of ethanol (750^il) 
and cooling to -2O°0 for 20 hours. The precipitate was collected by centrifugation and washed with ethanol 

45 (1ml) then dissolved in water (15nl) and formamide/dye mix (10*il) and purified on a 10% polyacrylamide gel 
in 50mM Tris-borate (pH8.3), 1mM EDTA and 8.3M urea. Bands for strands of appropriate lengths (173-186 
bases) were identified by autoradiography and isdated together by electroelution from a single gel slice as des- 
cribed above for individual digonucleotide sequences. The DNA strands were annealed by first heating an 
aqueous solution (SO^I) at 100°C for 2 minutes, then allowing it to cool to 40°C over 4 hours. 

50 Groups I, II and 111 were ligated together essentially as described for the group preparation to give as the 
product the gene sequence shown in Figure 2. After precipitation, the gene was phosphoryiated with T4 
pdynudeotide kinase as described previously for individual oligonudeotides, then dissolved in water (20^1). 



55 



23 



EP0 459 630 A2 



TABLE 1 

5 DUPLEX OLIGONUCLEOTIDE NUMBER OF BASES IN 

TOP STRAND BOTTOM STRAND 



10 



25 



A 


SEO 


ID No 


i * ^Pn m No 9 


o& 




TJ 
o 


CPA 


TH Ma 


j + SEQ ID NO 4 


60 


60 


c 


SEQ 


ID No 


5 + SEQ ID No 6 


48 


51 


D 


SEQ 


ID No 


7 + SEQ ID No 8 


63 


60 


E 


SEQ 


ID No 


9 + SEQ ID No 10 


63 


63 


F 


SEQ 


ID No 


11 + SEQ ID No 12 


60 


63 


G 


SEQ 


ID No 


13 + SEQ ID No U 


63 


60 


H 


SEQ 


ID No 


15 * SEQ ID No 16 


60 


60 


I 


SEQ 


ID No 


17 + SEQ ID No 18 


55 


53 


I 


A + 


B + C 




170 


-A 

175 


II 


D + 


E + F 




186 


186 


III 


G + 


H + I 




178 


173 



do b) Cloning of the synthetic gene for human G-CSF 

The synthetic gene described above, was cloned into the plasmid vector, pSTP1 (Windass et al, Nucleic 
Acids Research (1983) Vol 1 0, p6639. 

For vector preparation, 10ug of STP1 was dissolved in water (37.5uJ) and 10 x B restriction buffer (4.5uJ) 

35 (BCL). the restriction endonuclease Sail (3ul) (BCL, 8 units/uJ) was added and the mixture incubated at 37°C 
tor 1 hour unto linearised plasmid was predominant over supercoiled and nicked circular forms. The DNA was 
precipitated with ethanol at 4°C for 30 minutes, washed with ethanokwater (7:3) then dissolved in water 
(39.5uJ), 10X H buffer (4.5uJ) (BCL). The restriction endonuclease EcoRI (1uJ) (BCL, 90 units/uJ) was added 
and the mixture incubated at 37°C for 1 hour until the large EcoRl-Sall fragment was predominant The DNA 

40 was precipitated at -20°C for 20 hours, washed with ethanokwater (7:3) then dissolved in water (20ul) 

The large EcoRI - Sail fragment was purified on a 1% preparative agarose gel and electroeluted and pre- 
cipitated as described previously, then dissolved in water (20uJ). For ligation of the synthetic gene, a mixture 
of vector DNA (2uJ of the EcoRI - Sail fragment solution), synthetic gene (5uJ of the aqueous solution described 
previously, 5X ligase buffer (6ul -250mM Tris pH7.6 50mM MgCI 2 , 25% W/V PEG8000, 5MM ATP, 5mM DTT 

45 exBRL) water (15pJ) and T4 DNA ligase (2uJ, lU/uJ) was incubated at 16°C for 4 hours. The DNA mix was used 
directly (either 1uJ of neat ligation mix or 2uJ of ligation mix dSuted 5X with water) to transform E. coli strain 
HB101. The DNA mixture (1 or 2uJ) was added to competent E. coli HB101 cells (20uJ, BRL) on ice and the 
mixture incubated on ice for 45 min then heat shocked at 42°C for 45 seconds. After 2 min on ice, 100ui of 
SOC buffer (Bactotryptone 2%; Yeast Extract 0.5%; Nad 10mM; KCI 2.5mm; MgQ^ MgS0 4 20mm (10mm 

so each); glucose 20mm) was added and the mixture incubated at 37°c for 1 hour, aliquots of suspensions were 
plated onto L plates with 50ul/ml ampicillin. transformants were screened for the presence of cloned synthetic 
gene by colony hybridisation analysis using standard methods described in "Molecular Cloning: A Laboratory 
Manual" by Maniatis et a[ (Cold Spring Harbor) and in UK Patent Application No 8502605. A total of 100 col- 
onies were streaked onto filters (Schleicher and Schuell), grown at 37°C for 20 hours, lysed and baked. The 

65 filter was hybridised at 65°C for 20 hours with a radioactive probe prepared from oligonucleotide sequence 
SEQ ID No 1 by use of a random-label kit (Pharmacia). Five colonies 1-5 giving a positive hybridisation signal 
were grown up in L broth at 37°C for 20 hours on a small scale (100ml) and plasmid DNA prepared by cen- 
trifugation in a caesium chloride gradient essentially as described in "Molecular Cloning; A Laboratory Manual" 
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by Maniatas et a[ (Cold Spring Harbor). 

The DNA was sequenced by the standard dideoxy chain-termination method as described by Sanger et a) 
in Proa Nat Acad Sci. USA 74. 5463-5467 (1977) using a Sequenase (Trade Mark) kit (United States Biochem- 
ical Corporation). Oligonucleotides SEQ 1 D No 19 to SEQ 1D No 23 (as defined hereinafter and see Table 2) 
5 were used as sequencing primers. 

TABLE 2 

10 

CODE PRIMING SITE 



SEQ 


ID 


No 19 


214-234 top strand 


SEQ 


ID 


No 20 


333-353 top strand 


SEQ 


ID 


No 21 


375-395 bottom strand 


SEQ 


ID 


No 22 


207-227 bottom strand 


SEQ 


ID 


No 23 


69-93 bottom strand 



The plasmid DNA from done 5 contained the DNA sequence shown in Figure 2. The plasmid (pAG88) was 
25 used to transform competent ceils of the following E.coli strains by standard procedures:- 
HB101 

CGSC 6300 (hereinafter also referred to as MSD 522) 

The E. cdi strains HB101 and MSD522 (CGSC 6300) are freely available. Thus for example they may 
be obtained from the E. coll Genetic Stock Centre, Yale University, USA. Moreover E. coli HB101 may 
30 additionally be obtained from for example BRL supplied by GIBCO Limited Unit 4, Cowley Mill Trading Estate, 
Longbridge Way, Uxbridge, UB8 2YG, Middlesex, England or GIBCO Laboratories, Life Technologies Inc., 
3175 Staley Road, Grand Island, NY 14072, USA. The genotype of strain HB101 is described in the aforemen- 
tioned "Molecular Cloning - A Laboratory Manual* as Sup E44 hsd S20 (r s ~ me" )rec A 13 ara-14 Fleu 6 thi-1 
proA2 lac Y1 gal K2 rps L20 xyrs mtM. The genotype of MSD 522 (CGSC 6300) is set out in Example 13. 

35 

c) Cloning of the gene for human G-CSF into an expression vector 

The gene described above was cloned in the plasmid pICI 0020 as described in Example 1(c) to yield 
the expression plasmid pIC1 1056. 

40 

d) Fermentation 

The plasmid pIC1 1 056 was transformed and fermentation effected as described in Example 1 (e) to achieve 
expression of human G-CSF. 

45 

e) Purification 

Purification was effected as described in the second purification procedure developed to yield larger quan- 
tities of hu G-CSF set out on pages 48 and 49 of PCT Patent Publication No. WO 87/01 132 with final dialysis 
so being effected against phosphate buffered saline. 

Reference Example 2 

Preparation of genes for derivatives of human G-CSF by site-directed mutagenesis 

55 

The phosphorothioate method of Eckstein and co-workers was used: 
Taylor, J Wetal Nucleic Acids Research (1985) Vol pp 8749-8764 
Taylor, J W et al Nucleic Acids Research (1985) Vol pp 8765-8785 

25 
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Nakamaye, Ketal Nucleic Acids Research (1986) Vo! pp 9679-9698 

Sayers, J R et al Nucleic Acids Research (1988) Vol pp 791-802 
The procedure can be carried out using a kit supplied by Amersham International. The method is outlined below 
and incorporates changes to the original method with regard to the use of more than one mutagenic oligonuo- 
5 teotide and the incubation temperature for oligonucleotides of greater than 30 bases in length. 

1. Annealing mutant oligonucleotide to single stranded DNA template: 

Single stranded DNA template (lug/ul) 5yi 

10 Phosporylated mutagenic oligonculeotide (1.6praol/lul) 2.5yl 

Buffer 1 3. 5ul 
Water 6 y l 

is (Where two mutagenic oligonucleotides were used simultaneously, 2.5ul (1 .6pmole/1 ul) of each phosporylated 
dig onudeotide was added to 5pJ single stranded DNA template (1 ng/uJ) in 3.5uJ Buffer 1 and 3.5ul water. Where 
3 mutagenic oligonucleotides were used 2.5uJ (1.6pmol/ul) of each phosporylated oligonucleotide was added 
to 5uJ single stranded DNA (1ug/ui in 3.5ul Buffer 1 and 1ul water). The above ingredients were placed in a 
capped tube In a 7O°0 water bath for 3 minutes if the oligonucleotide was OObases in length or in a boiling 

20 water bath for 3 minutes if the oligonucleotide was > 30 bases in length. The tuba was then placed in a 37°C 
water bath for 30 minutes. 

2. Synthesis and ligation of mutant DNA strand: 
To the annealing reaction were added 



MgCI 2 solution 5uJ 

25 Nucleotide mix 1 1 9ul 
(contains dCTP alpha S) 

water 6uJ 

Wenow fragment (6 units) 1 ,5uJ 

T4DNAIigase(5units) 2uJ 



30 The above ingredients were placed in a 16°C water-bath and left overnight 

3. Removal of single stranded (non-mutant) DNA using disposable centrifugal fSter units. 
To the reaction from Step 2 the following ingredients were added:- 

^ Water 170ul 

5M NaCl 30ul 

The 250uJ sample was added to the top half of the filter unit and centrifuged at 1500 rpm for 10 minutes 
at room temperature in a SORVALL RT6000B bench top centrifuge using a SORVALL H1000B swing out 
40 rotor. Sample passes through two nitrocellulose membranes which bind the single stranded DNA leaving the 
dc;hle stranded DNA to pass through to the collection tube below. 

IOOuJ of 500 mM Nad were added and respun for 10 minutes to wash through any remaining RF DNA. 

The following ingredients were added to the filtrate:- 
3M Sodium Acetate (pH6.0) 28ul 
45 Cold Ethanol (-20°C) 700uJ 

The mixture was placed in a dry ice and ethanol bath for 20 minutes and centrifuged in an Eppendorf micro- 
fuge for 15 minutes. The pellet was then resuspended in 10uJ buffer 2. 

4. Nicking of the non-mutant strand using Nci I. 

To the reaction mix from step 3, was added 65ul Buffer 3 and 8 units Nci I (1 uJ). The mixture was placed in a 
go 37°C water bath for 90 minutes. 

5. Digestion of non-mutant strand using exonuclease III 
To the reaction mix from step 4 was added 

500 mM NaCl 12uJ 
Buffer 4 10uJ 
55 Exonuclease III (SOunits) 2uJ 

The mixture was placed in a 37°C water bath and incubated for 30 minutes at 37°C, 50 units of exonuc- 
lease III will digest approximately 3,000 bases in 30 minutes). The mixture was then placed in a 70°C water 
bath for 15 minutes to inactivate the enzymes. 
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6. Repolymerisation and ligation of the gapped DNA. 
To the reaction mix from step 5 was added 



nucleotide mix 2 13jJ 

MgC! 2 solution 5ui 

DNA polymerase 1 (4 units) 1 uJ 

T4 DNA ligase (2.5 units) 1 uJ 



10 



15 



TO 



25 



30 



35 



40 



45 



50 



The mixture was placed in a 1 6°C bath for 3 hours. 

7. Transformation of competent host E coli TG1 cells with the DNA: 
300ul of freshly prepared competent E. coH TG1 cells (prepared following the method of Mandel and Higa) 
were transformed with 20uJ of the reaction mix from step 6 (In duplicate). 

The transfbrmants were plated out in a lawn of log phase TG1 cells in TY Top agar on TY plates and incubated 
overnight at 37°C. 

The E. cdi strain TG1 is freely available from for example the E. coli Genetic Stock Centre, Yale University, 
USA and from Amersham International pic, Amersham Place, Little Chaifont, Amersham, Buckinghamshire 
HP79NA, England as supplied in their "in vitro" mutagenesis system, oligonucleotide directed kit (Product code 
RPN1523). 

Reference Example 3 

G-CSF Btoassay 

A factor dependent ceil line, Paterson - G-CSF (FDCP-G), obtained from the Paterson Institute, Manches- 
ter, England was cloned by limiting dilution in the presence of G-CSF. A G-CSF responsive clone, designated 
clone E7, was used to determine human recombinant G-CSF activity. 2.5x10* FDCP-G done E7 cells in tOOuJ 
of RPM1 1 640 + 10% FCS was added to an equal volume of RPM1 1 640 + 1 0% FCS containing G-CSF. Each 
G-CSF sample was measured over 10 doubling dilutions. The final volume of RPM1 1640 (see Moore GE et 
ai (1967) JAMA, 199, 519) + 10% FCS (foetal calf serum) in each well of 96-well mlcrotHre plate was 200ul. 
The microtitre plate was incubated at 37°C in 5% C0 2 in a humidified incubator for 4 days. LOpCi of titrated 
thymidine was added per well and incubated over the final 6 hours. Cells were harvested onto glass fibre filter 
papers and the level of radioactivity determined by liquid scintillation counting. The level of tritiated thymidine 
incorporation was found to be directly proportional to the amount of G-CSF present The FDCP-G clone E7 
assay was calibrated using recombinant human G-CSF obtained from Amersham International with a declared 
specific activity of 1 0 6 units/mg of protein. 

The potencies of G-CSF samples were determined by comparision to a standard of known activity. 

The units of G-CSF activity per ml were calculated according to the following formula:- 



Dilution of G-CSF Dilution of sample Units/ml 

standard giving j giving 50X maximal activity 

50% maximal increase * increase in X in G-CSF 

3 3 

xin H-thymidine H-thymidine standard 



Reference Example 4 

Solution Stability of G-CSF and derivatives thereof 

Appropriate dilutions of the stock solution of G-CSF and derivatives in phosphate buffered saline (PBS) at 
4°C described in Example 9 were tested for solution stability. Solutions of 1mg/ml, 5mg/rnl and sometimes 
10mg/ml of protein in PBS were incubated at 37°C for 14 days. Solutions were inspected visually at regular 
intervals for signs of precipitation. After 14 days each solution was centrifuged at U.OOOrpm for 20 minutes, 
the supernatant removed by decantation and the pellet re-dlssolved in PBS containing 1% w/v N-lauroyl sar- 
coslne. The total protein content in each supernatant and re-dissolved precipitate was estimated by A^o meas- 
urements and the monomer content in each was estimated by reverse phase HPLC. These were expressed 
as a percentage of the corresponding data given by solutions at the start of incubation and by a 1mg/ml solution 
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incubated at 4°C for 14 days. Variations between total protein and monomer estimates were observed only in 
some of the re-dissolved pellets. The percentage protein remaining in solution in the supernatants from each 
starting concentration is summarised in the Table. 

The specific activity of the product in each supernatant after incubation was shown to be the same as in 
5 the starting solution, and no differences were observed on PAGE-SDS under reducing or non-reducing condi- 
tions. 

The following results were obtained: 



10 



15 



20 



25 



so 



65 



G-CSF Derivatives 



Spec. Act. Solution Stability* 
(U/mg xlO 9 ) lmg/ml 5mg/ml lOmg/ml 



lMet _i ]hu G-CSF 
[Met~ 1 ,Ser 17 ]hu G-CSF 



ptot~ l f S«r l7i27 lini G-CSF 



IMer^Ar^^Ser 17 ' 27 ' 60 ' 65 ] 
hu G-CSF 

U6 Jhu G-CSF 

[Mer^Ser 17 ' 27 ,^ 11 ' 165 ,^ 58 ] 
hu G-CSF 

[Met-^Glu^Ser 17 ' 27 ,Ala 26 ' 28 , 



0.4 
1.0 

1.5 



Lys JU ]hu G-CSF 



1.2 
2.7 
1.2 
1.0 
1.0 



IHef^Ser^'^^ys 49 ' 58 ,^ 44 ' 51 ' 55 ) 
hu G-CSF 

(Met" 1 , Ars n ,Glu 15 ,Ser 17 ' 27 ' 60 ' 65 , 

Ala 26 ' 28 ,Lys 30 ] hu G-CSF 2.6 

r N Met _1 ,Glu 15 , Ser 17 ' 27 , Ala 26 ' 28 , 
30 

Arg JU ]hu G-CSF 0.85 
[MerW^W 7 ' 27 ' 60 ' 65 ] hu 
G-CSF 2.5 
lMet-\Arg U ' 3 \ser 17 ' 27 ' 60 ' 65 ] 



hu G-CSF 



1.4 



23 
80 

80 



98 
100 

92 
100 

84 
100 
100 
100 
105 



nd 
20 

40 



nd 
nd 

nd 



94 

72 

77 
100 

69 

103 

100 
98 
92 



92 
50 
47 
94 
44 
93 
100 
88 
80 
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10 



15 



30 



[Met-W 1 ' 40 ^ 17 ' 27 ' 60 ' 65 ] 
hu G-CSF 

|H«t~ l f Ala l ,Ihr 3 ,iyr 4 ,Arg 5,U f 
Ser 17,27,60,65 ]hu G . CSF 

[Mer 1 ,Arg 11 ,61u 15 ' 111 f S e r 17 ' 27 ' 60 ' 65 
115,116 fAla 26,28 fLys 30 ]hu G . CSF 

[Met" 1 , Arg 11 • 165 ,Glu 15 , Ser 17 ' 27 • 60 « 65 

Ala 26 ' 28 ,Lys 30 ' 58 ]hu G-CSF 

[Met-WW 5 ,Ser 17 ' 27 ' 60 ' 65 , 
Ala 26,28,44,51,55 Lys 30,49,58 ] 

hu G-CSF 

IMet-W 1 ' 165 ^ 15 ' 111 , 
Ser 17, 27 , 60, 65 , 115 , 116 

^26,28,44,51,55, Lys 36,49,58j 
hu G-CSF 

[Met- 1 ,Arg 11 ,Ser 17 » 27 lhu G-CSF 
[Met^.Ser 17 ' 27 ' 60 ' 65 ]^ G-CSF 
[Met- 1 ,Arg 11 ,Ser 17 ' 27 ' 60 ]hu G-CSF 
[Met' 1 ,Arg 11 ,Ser 17 ' 27 ' 65 ]hu G-CSF 
[Mer 1 ,Ser 17 » 27 ' 60 Jhu G-CSF 
[Met _1 ,Ser 17 ' 27 ' 65 ]hu G-CSF 



1.3 


108 


100 


87 




lUb 


100 


89 


0.5 


100 


100 


100 


0.65 


100 


100 


100 


0.20 


100 


100 


95 


0.05 


100 


100 


100 


0.73 


97 


35 


12 


0.71 


100 


94 


86 


0.81 


99 


65 


32 


0.80 


100 


96 


89 


0.80 


95 


68 


36 


0.83 


100 


94 


90 



* percentage left in solution in PBS after 14 days at 37°C (established 
by UV; by HPLC available) 



nd means not done 

40 [Met- 1 ,Ser 17 ]hu G-CSF may be obtained as described in Reference Example 5. 

The above results demonstrate that modifications of the present Invention improve solution stability without 
lo*s of G-CSF activity, [MeH.Ser 17 ]hu G-CSF in a concentration of 5mg/ml starting to precipitate out within 3 
hours. 

45 Reference Example 5 

Preparation of [Ser 17 ] hu G-CSF 

The procedure described in Example 2 for the preparation of [Met* 1 , Ser 17 * 27 ] hu G-CSF was repeated 
so except as fbllows:- 

1) The duplex for phosphorylation was prepared from oligonucleotide sequences SEQ ID Nos 24, 25, 3 
and 4, the sequences SEQ ID Nos 3 and 4 respectively replacing sequences SEQ ID Nos 26 and 27 
employed in Examples 1 and 2. 

2) The duplex referred to in (1) was phosphoryiated with T4 polynucleotide kinase, but was digested with 
55 SnaBI (10 units) in 1 x M buffer (BCL; 30nl) for 2 hours at 37°C. 

3) Following purification with ethanol, the 72bp EcoRI-SnaBI fragment was purified as opposed to the 143 
bp EcoRI-Mstll fragment 

4) The synthetic EcoRI-SnaBI fragment was cloned into the plasmid vector pAG88 as described in Refer- 
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ence Example 1 and for vector preparation pAG88 was digested with SnaB! (20 units; BCL) in 1 x M buffer 
(BCU 100 nl) for 2 hours at 37°C instead of Mst II In 1 x H buffer. 

5) Following precipitation with ethanoi t the large EcoRI-SnaBI fragment was purified on a 1% agarose gel 
as opposed to the large EcoRI-Mstll fragment 
s 6) The plasmid containing the gene for [Ser 17 ] hu G-CSF was designated pIC1 1105. 

Reference Example 6 

Construction of plCl OQ80 

10 

a) Construction of pTB357 (also referred to herein as pLB 004 

Plasmid pTB357 utilises a repressed tetracycline resistance determinant, as found on the naturally-occur- 
ring plasmid RP4. This repressed system shuts off expression of the tetA gene in the absence of tetracycline 
is whereas most drug resistant mechanisms have constitutive expression. 

The tot locus was first mapped on RP4 by Barth and Grinter (J.Mol. Biol.113: 455-474, 1977). This was 
shown to consist of adjacent genes: tetA, the structural resistance gene and tetR, the repressor gene and this 
region has been sequenced (Klock etal, J. Bacterioi: 1 61 :326-332, 1 985). These genes are located on adjacent 
BgHl-Smal and Smal-Smal fragments. The Bg1ll site is unique in RP4 but there are five Smal sites (Lanka, 
20 Lurz and Furste, Plasmid 10: 303-307, 1983). 

Q Cloning the tetA + tetR genes 

The plasmid RP4 is well documented (Datta et al. J. Bacterioi 108: 1244, 1971) and is freely available.* 

25 Furthermore the plasmid RP4 has been deposited with the National Collection of Type Cultures, 61 Colindale 
Avenue, London, NW9 5HT under accession nos. 50078 and 5Q437. E. coli strains containing this plasmid 
were grown in selective broth cultures and plasmid DNA was isolated a scale-up of the Holmes and Quigley 
method (Holmes and Quigley, Anal. Biochem 114: 193-197, 1981). It was deproteinized by treatment with 2.5M 
ammonium acetate and reprecipitated with isopropanol. This plasmid DNA was treated, according to the sup- 

30 pliers recommended conditions, with restriction enzyme BfllH and cut to completion. H was then partially cut 
by Xmal by using diluted enzyme and short incubation times. Xmal is an isoschizomer of Smal but which pro- 
duces 4-nudeotide cohesive ends at its cut sites. 

The vector plasmid pUC8 (Yanisch-Perron, Vieira and Messing, Gene 33: 103-119, 1985) was similarly 
prepared and cut with BamHl and Xmal to completion. The RP4 fragments were cloned into this vector by I iga- 

35 tion with T4 ligase at 12°C for 16 hours. This was used to transform E, coli C600 made competent by the cal- 
cium chloride method (Maniatis etal. Cold Spring Harbor Laboratory, 1982). Cultures were then plated onto 
medium which selected for tetracycline resistance. 

E. coli C600 is freely available from numerous sources including many culture collections such as the E- 
.coli Genetic Stock Centre, Yale University, USA under accession No GCSC 3004. The genotype of E.coti 

40 C6O0 is K12 thr-1 leuB6 thM hsdS1 lacY1 tonA21 V supE44. 

Several colonies with this resistance were checked for the expected phenotype (ampicOlin and tetracycline 
resu>tance but not the kanamycin resistance indicative of RP4 itself). Colonies with the correct resistances were 
subjected to clone analysis by isolating plasmid DNA (Holmes and Quigley method). These preparations were 
cut with EcoRl and Hindlll and analysed by gel electrophoresis. This established the size of the cloned insert 

4$ which was found to be the 2.45 kb predicted for the Bg1H - Xmal - Xmal fragment from RP4. A done carrying 
this fragment containing the tett and tetR genes was designated pTB344. 

fi) Removal of the tet gene from pAT153 

so It was necessary to remove the tet gene from the vector plasmid pAT153 before inserting the tetA + tetR 
cassette from RP4 to prevent gene duplication which can be a source of genetic Instability. Also the tet gene 
may not be effectively suppressed byt he non-cognate tetR . The removal was done by isolating plasmid pAT1 53 
DNA and cutting it with EcoRl and Aval. Between these sites, synthetic olignucleotides with the sequence SEQ 
ID No.59:- 

55 

5' AATTCGCATGCGGATCCATCGATC3 ' 

3 ' GCGTACGCCTAGGTAGCTAGAGCC5 ' 
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were clonded. These fft the EcoRI and Aval cohesive ends and contain Sphi BamHI and Oal sites in addition. 
After transformation and selected, colonies were tested for the loss of the teracycline resistance determinant 
Plasmid DNA from one clone was sequenced to confirm that the predicted sequence was correct This plasmid 
was designated pCH19. 

5 

HQ Insertion of the tetA + tetR genes 

The tetA and te« genes were isolated from pTB344 on an EcoRI to Psti fragment The pUC8 vector was 
destroyed by curring with Ssjgi because it carries the same selection determinant (ampicfllin resistance) as 

10 pCH19. Plasmid pCH19 DNA was cut with EcoRI and Psti and then ligated with the 2.45 kb fragment carrying 
the tet genes. This was used to transform E.coii C600, the culture being plated out under selection for tet- 
racycline reistant colonies. The insertion of the tot genes was designed to replace most of the bla genes In 
pCH19 which should thus lose its ampicillin resistance determinant toss of ampicfllin resistance from the trans- 
formants was confirmed. A few clones were then used to isolate plasmid DNA which was subjected to restriction 

is analysis. This confirmed that the constructed plasmid had the intended structure. It was designated pTB351 . 

iv) Insertion of the cer sequence 

The naturally-occuring plasmid CoIEl is very stably maintained in E.cofl, whereas its derivatives pBR322 
20 and pAT153 are not Summers and Sherratt (Cell. 36: 1097-1103, 1984) demonstrated that this was due to 
the derivatives not containing a short (283 bp) sequence called ar which Is present in the parent plasmid. This 
sequence contains a site-specific plasmid multimer-resolution system which prevents the accumulation of plas- 
mid multimers formed by homologous recombination. Such multimers have a deleterious effect on the process 
of partition which normally ensures stable inheritance of daughter plasmids during bacterial cell division. 
25 Thercr sequence (Summers, D et al MGG, 201, p334-338, 1 985) was isolated from plasmid pKS492 (pro- 
vided by D. Sherratt) as a 289 bp fragment by cutting with BamHI and Taql. The plasmid pTB351 was isolated 
as DNA from a dam strain of E. coli to prevent its Oajsite being blocked by the dam+ methylation system- This 
DNA was cut with BamHI and Oal (both these sites having been introduced on the synthetic oligonucleotide 
for this cloning). The cer fragment was ligated with the cut vector and then used to transform E. cdi C600, 
$0 selection being made for tetracycline resistance. Transformant colonies were subjected to clone analysis by 
Aval restriction and gel electrophoresis. The presence of an extra DNA band of about 300 bp indicated the 
acquisition of the cer fragment Further restriction analyses were used to confirm that resultant plasmids had 
the coiTect structure. One of these was designated pTB357 (Figure 5) and also designated pLB004. 

35 b) Plasmid pCH1Q1 

The plasmid pCHIOI corresponds to pICI 0020 (see Example 1c) except that the EcoRI-Sall fragment 
(see Figure 1) is replaced by a fragment consisting of the SEQ ID No 53 (see Figure 6 also) and the interferon 
a 2 gene sequence as described by Edge M.D. et al, Nucleic Acids Research 1983, Vol1 1, p641 9-6435. In this 
40 regard the 3'-terminal ATG codon of SEQ ID No 53 immediately precedes the TGT codon which codes for cys- 
teine (amino acid 1) in the interferon a* sequence of the above-mentioned Edge M.D. et al Nucleic Acids 
P-search reference. The 5' nucleotide sequence GATCCATG and the complementary 3' nucleotide sequence 
G i AC are thus omitted from the nucleotide sequence of the aforementioned reference. 

4$ c) Insertion of an Expression Cassette into pTB357 

An expression cassette consisting of the trp promoter, a ribosome binding site and the interferon ofe gene 
was isolated from plasmid pCHIOI (see b above) on an EcoRI to Sphi restriction fragment This was ligated 
into the production vector (pTB357) (see (a) above) simflarly cut with EcoRI and Sjghl. This DNA was used to 

60 transform a competent culture of E. coll C600 and tetracycline resistant colonies were Isolated. A few of these 
were tested by DNA clone analysis for the acquisition of the Ssti restriction site carried on the expression cas- 
sette. Clones positive in this respect were further tested by restriction mapping to check that the expected con- 
struct was correct They were also checked for the conferred capacity to produce interferon 02 protein as 
analysed on a polyacrylamlde-SDS gel stained with Coomassie blue. One such confirmed clone was desig- 

55 nated pLBOOS. 
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d) Insertion of T4 transcription terminator into pTB 244 

The T4 transcription terminator sequence in the form of the Sail to Hindlil fragment (67 bases pairs long) 
(see SEQ ID No. 51 and Figure 4a) was inserted into the multicloning site of an intermediate vector pTB 244 

5 (described in European Patent Publication No. 237,269) between its Sail and Hindlil sites. Clone analysis was 
used to confirm the structure of this construct (pTB244. T4 ter). From this vector, an Sstl to Sphi fragment con- 
taining most of the multicloning site and the T4 terminator was then isolated. This was inserted into pLBOOS 
similarly cut with Sstl and Sphl thereby substituting the interferon ct 2 gene but leaving a cassette consisting of 
the tg> promoter, multicloning site and T4 terminator. This construct was confirmed by done analysis and the 

10 plasmid designated pLB013. 

e) Substitution of the multicloning site 

The multicloning site in pLB013 is not ideal for this vector in several respects: the Sail BamHI and Smaj 
15 sites are not unique but exist elsewhere on the plasmid. This fragment was therefore excised by cutting with 
Sstl and Xbal (both unique) and synthetic oligonucleotides with the sequence of SEQ ID No. 54:- 

5' AGCTCCATATGGTACCAGATCTCTCGAGAGTACTT 
20 GGTATACCATGGTCTAGAGAGCTCTCATGAAGATC 5' 



were inserted in its place. Clones were analysed for acquisition of the new restriction sites and then confirmed 
by sequencing. One such plasmid was designated pLBQ14. The new cloning sites inserted in this way are: 
25 Ndel, Kpnl, Bglll, Xhol, and Seal with the previous Xbal and Sail following them. 

f) Further modification 

It was discovered that the adjacent Sstl and Ndel sites in pLB014 could not be cut by both these restriction 
30 enzymes either simultaneously or sequentially presumably because of their close proximity. Anadditional sequ- 
ence was therefore inserted between them. This was done by cutting pLB014 with Ssti and Kpnl and then 
inserting the synthetic oligonucleotide of SEQ ID No. 55. 

5' AGCTCAGCTGCAGCATATGGTAC 
35 GTCGACGTCGTATAC 5' 

Clones were analysed for acquisition of an extra Pvull or Pstl site and then confirmed by sequencing. One such 
plasmid was designated pLB015 (=plCI 0080) (see Figure 7). This plasmid, unlike pLB014, is efficiently cut 
40 by SstJ and Ndel. This is to provide a place to insert a variety of ribosome binding site sequences correctly 
positioned with respect to the upstream trp promoter and with Ndel designed to provide the ATG start codon 
of i)% gene to be expressed. 

Reference Example 7 

45 

Construction of plasmid plCl 1295 (also referred to as pCG300 

a) Production of pCG54 from plCH079 

so plCI1079 is an ampicillin resistant, pAT153-derived plasmid containing the following elements between 
the EcoRI and Styll restriction sites:- 

(i) a CI857 from phage X; 

(ii) a XP L promoter; 

(Hi) a synthetic ribosome binding site; 
55 (iv) a synthetic interferon a 2 gene sequence; 

(v) a synthetic transcription terminator sequence, derived from phage T4, between the Sail and Styl res- 
triction sites. The DNA sequence of this transcription terminator is shown in Figure 4 and SEQ ID No. 56. 
plCI1079 is illustrated in Figure 8. 
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plCI1079 has been deposited under the Budapest Treaty, at the National Collections of Industrial and 
Marine Bacteria Limited (NCI MB), 23 St Machar Drive, Aberdeen, AB2 1 RY, Scotland, UK. (NCIMB No 40370, 
date of deposit 19 February 1991). 

pCG54 was constructed in order to make available an expression vector containing the same promoter, 
5 ribosome binding site and transcription terminator sequences as above, ie: Xpu RBS7 and T4, but lacking gene 
sequence encoding for production of a specific protein. Such a construct would provide the facility of a bask: 
expression vector containing essential elements allowing transcription and translation for production of any pro- 
tein of interest which could be introduced into this vector by subsequent doning events. 

Construction of the vector was initiated by restriction endonuclease cleavage of plCI1079 at its respective 
10 EcoRI and Sail sites. This cleavage step released a vector fragment containing the p!CI1079 backbone com- 
plete with genes for piasmid replication and antibiotic resistance functions, plus the T4 transcription terminator 
sequence. The fragment was isolated by agarose gel purification steps using Geneciean for final purification 
of the DNA fragment 

To this vector fragment a second smaller DNA fragment of approximately 1.2Kb in size was introduced. 
15 This second fragment may be obtained, for example by DNA synthesis or by site directed or PCR mutagenesis 
of the small EcoRI-Sail restriction fragment obtained from plCI1079 as described above. This second fragment 
contained exactly equivalent promoter and ribosome binding site sequences as originally present in plCI1079 
and additionally had EcoRI and Sail sites available at its 5' and 3' termini respectively, so providing compatible 
teiminifor ligation to the plCI1079 fragment A ligation reaction in the presence of Gibco-BRL enzyme T4 DNA 
20 iigase and its respective buffer, resulted in the formation of the construct pCG54. 

Clones containing this construct were originally isolated following transformation of an aliquot of the ligation 
reaction mixture into E.coli competent cells of strain HB101 . 

The construct pCG54 recovered was 3.682Kb in size and contained essential features as outlined on the 
map featured in Figure 9. 

2S 

b) Production of pCG61 from pCG54 (also referred to as plCI54) 

Synthetic oligonucleotide sequences were designed so as to include both the natural sequence for the 
T7A3 promoter and also a sequence which would provide an effective translation initiation region to enable 

30 correct processing of any polypeptide gene sequence cloned adjacent to it A suitable candidate sequence for 
this latter region was identified as RBS1, the trp ribosome binding sequence. Therefore two complimentary 
oligonucleotides identified as SEQ ID No.57 and SEQ ID No.58 were synthesized to generate a double stranded 
DNA linker incorporating the T7A3 promoter and RBS1 sequences . 

Oligonucleotides were prepared as 84mers by the standard protocol using an ABI gene synthesizer. They 

35 were designed so that in the double stranded forni the syntheticfragments would have restriction endonuclease 
sites EcoRI and Kpnl at the 5' and 3' ends respectively. Due to their length the oligomers could not be purified 
by means of HPLC and purification was undertaken by means of acrylamlde gel electrophoresis using a 10% 
acrylamide: 7M Urea gel. 

Prior to purification, the oligomers were first checked on a sizing gel to ensure that not only are they of the 
40 correct size but that also the samples prepared contain as their greatest proportion the oligomers required and 
not a high contaminating proportion of smaller secondary oligonucleotides which result as by-products of synth- 
e 

The acrylamide gels were prepared by standard methods with ammonium persulphate and N,N f N',N'-tet- 
ramethylethyienediamine used as catalysts for gel polymerisation. 
45 Sizing of the oligonucleotides required thatthey could be visualized after electropohoresis. Itwas therefore 
necessary to radioactively label the samples using *P. This made it possible to assess sample quality following 
electrophoresis by way of autoradiography. 

Oligonucleotide samples were supplied in a erode form unphosphoryiated. This factor was made use of 
for radiolabelllng purposes in that the samples could be 'hof labelled at the 5' termini by phosphorylation using 
so the enzyme T4 polynucleotide kinase. 

Oligomers were provided from synthesis In an unphosphoryiated form and so after purification each 
oligomer was individually subjected to a phosphorylation reaction in which ATP was used to phosphorylate the 
5' end of each molecule in the presence of T4 polynucleotide kinase, (see Molecular Cloning: A Laboratory 
manual 2nd Edition. Sambrook, Fristch and Maniatis. p 5.68-5.71). Once phosphorylated the two complimen- 
55 tary oligonucleotides were annealed together to form the double strand DNA duplex containing the T7A3 pro- 
moter and the RBS1 sequence. 

The vector molecule pCG54 was cleaved with restriction enzymes EcoRI and Kpnl. On restriction digestion 
2.3kb vector fragment and a 1 .1kb fragment containing the promoter and RBS1 sequence are generated. 
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This cloning step is planned to replace the Xpl-RBSI sequence by EcoRI to Kpn1 syntheticfragmentcomprising 
the T7A3-RBS1 sequence. The Z3kb vector fragment resulting from digestion of pCG64 was purified by the 
usual protocol using agarose ge! electrophoresis and Geneclean methodology for removal of DNAfrom agarose 
fragments. 

6 The 84bp EcoRI-Kpnl synthetic fragment was ligated into the vector molecule prepared above and the 
ligated DNA used to transform E.coll HB101 ceils. Selection of positive recombinant clones was by ampicillin 
resistance. Following transformation a number of colonies containing recombinant plasmid were selected for 
screening purposes. 

The synthetic fragment incorporated into the vector during cloning was of a size (84 mer) such as to make 

10 restriction analysis of recombinant plasmid DNA samples inappropriate as a simple screening method. Inserts 
of such a small size are not readily apparent on agarose gel electrophoresis. The fragment itself contains no 
internal restriction endonuclease cleavage site which could be diagnostic of its presence. Initial screening of 
recombinant clones was therefore by the method of colony hybridisation (see Grunstein and Hogness Proc. 
Natl Acad. Set 72, 3961 (1975)). Nitrocellulose filters containing immobilized plasmid DNA from the recombinant 

15 clones were hybridised against a probe prepared by random radiolabelling of the synthetic annealed oligonuc- 
leotide SEQ ID No. 57 and SEQ ID No.58 .The DNA was labelled using cW-dCTP and incubation with Klenow 
polymerase at 37°C for 2 hours. Recombinant colonies which generated a positive hybridisation reaction were 
selected for plasmid DNA preparation. Plasmid DNA was prepared in each case by a relatively large scale 
method incorporating CsCI gradient density centrifugatlon to ensure purity see " Molecular Cloning - A laborat- 

20 ory manual "second edition, Sambrook Fritsch and Ma niatis (Cold Spring Harbor Laboratory, 1 989) p1 .42-1 .52. 
Preparation of DNA by such a method ensures high quality material suitable for use in subsequent cloning mani- 
pulations and sequence analysis. 

All plasmid DNA Isolated from recombinant clones was included in a secondary screen by sequence 
analysis, to ensure that the oligonucleotide sequence at the cloning junctions and of the T7A3-RBS1 fragment 

25 itself was absolutely correct The sequencing protocol used was that of Sequenase and the sequencing primer 
selected for use was for example pBR322 UP (pBR322 universal primer). Sequencing was effected using the 
Sanger dideoxy chain termination sequencing technique. 

Clones having the correct sequence were designated as the new expression construct pCG61, and contained 
the T7A3 promoter, RBS1 sequence and the T4 terminator sequence (see Figure 10). 

30 

c) Production of pCG3QO (also referred to as pIC1 1295) from pCG61 

The sequence and synthesis steps involved in construction of the G-CSF analogue [Ser 17 ' 27 ]hu G-CSF are 
as described in Example 1 (see Figure 3). This G-CSF analogue sequence was isolated from a construct in 
35 which the gene had been incorporated into the plasmid pSTP1 to give plCI1 107 (see Example 2). pICH 107 
was digested with Seal and the large fragment isolated following agarose gel electrophoresis and Geneclean 
purification. This fragment was then digested with the restriction endonuclease Sail to generate a [Ser^^hu 
G-CSF gene on a Seal to Sail restriction fragment suitable for cloning into pCG61 (see Figure 10). 

Following restriction with Sail the required fragment was isolated using agarose gel purification techniques 
40 once again. 

The vector molecule pCG61 was d igested with restriction enzyme Kpn1 . Cleavage with this enzyme creates 
a 3' overhang which was then blunt-ended using the enzyme T4 polymerase see "Molecular Cloning - a Laborat- 
ory manual", Second Edition Sambrook, Fritsch and Maniatis, p5.44 - 5.47. T4 polymerase activity was heat 
inactivated by incubation at 70°C for 30 minutes and the DNA was recovered by ethanol precipitation. The 

45 pellet was dissolved in sterile distilled water and the solubilized DNA cleaved with Sail. The Kpnl (now blunt- 
ended) to Sail vector fragment was recovered by means of ethanol precipitation followed by agarose gel 
electrophoresis and purification techniques. 

The Seal to Sail [Ser^^hu G-CSF fragment was then ligated into the blunt-ended Kpnl to Sail vector. 
Ligated DNA was transformed into E.coll strain HB101 . Selection of recombinant clones was for ampicillin resi- 

60 stance. 

Initial screening of potential recombinant clones was by means of hybridisation. A radiolabeled probe was 
prepared by random labelling of an EcoRI to Sail fragment (containing the [Ser 17 * 27 ]^ G-CSF gene sequence) 
prepared from the plasmid pICH 107. This was used in hybridisation against colonies whose DNA had been 
immobilized onto the surface of nitrocellulose filters. Subsequently plasmid DNA was prepared from 24 clones 
55 which had been hybridised in this screen. All DNA preparations were by the rapid mini-prep method see 
Bimboim and Dory, Nucleic Acids Research, 7, 1513, 1979. These recombinant DNA preparations were sub- 
jected to a secondary screen by way of restriction analysis. Linearization of the DNA with BamHI, which is a 
unique site within the expression cassette, is indicative of the presence of the [Ser^^hu G-CSF sequence. 
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Sequence analysis was performed to confirm the presence of the [Ser^hu G-CSF gene and to verity 
that the base sequence at the cloning junctions and throughout the [Sei^^hu G-CSF gene was correct For 
this purpose large scale plasmid DNA samples were prepared from 16 recombinant clones using the CsCI gra- 
dient density centrifugation technique to ensure purity. Sequencing steps were performed in accordance with 
the sequence protocol and the sequencing primer selected was the pBR322 universal primer (EcoRI). Two of 
the recombinant clones contained the correct sequence at the Seal end of the [Ser^hu G-CSF fragment and 
throughout the G-CSF peptide sequence itself. The clones were identified as expression construct pCG300 
(see Figure 12). 

SEQ ID No 1 

SEQUENCE LENGTH: 62 bases 
SEQUENCE TYPE: Nucleotide 
STRANDEDNESS : Single 
TOPOLOGY: Linear 

AATTCAGT ACT CCA CTG GGT CCA GCA AGC TCT CTG CCG CAG TCT TTC 47 
CTG CTG AAG TGT CTC 62 

SEQ ID No 2 

SEQUENCE LENGTH: 64 bases 
SEQUENCE TYPE: Nucleotide 
STRANDEDNESS : Single 
TOPOLOGY: Linear 

CTG TTC GAG ACA CTT CAG CAG GAA AGA CTG CGG CAG AGA GCT TGC 45 
TGG ACC CAG TGG AGT ACTG 64 

SEQ ID No 3 

SEQUENCE LENGTH: 60 bases 
SEQUENCE TYPE: Nucleotide 
STRANDEDNESS: Single 
TOPOLOGY: Linear 

GAA CAG GTA CGT AAA ATT CAA GGC GAT GGT GOG GCT CTG CAG GAA 45 
,^G CTG TGC GCA ACC 60 

SEQ ID No 4 

SEQUENCE LENGTH: 60 bases 
SEQUENCE TYPE: Nucleotide 
STRANDEDNESS: Single 
TOPOLOGY: Linear 

TTT GTA GGT TGC GCA CAG CTT TTC CTG CAG AGC CGC ACC ATC GCC 45 
TTG AAT TTT ACG TAC 60 
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SEQ ID No 5 

SEQUENCE LENGTH: 48 bases 
SEQUENCE TYPE: Nucleotide 
STRANDEDNESS: Single 
TOPOLOGY: Linear 

TAC AAA CTG TGC CAC CCT GAG GAA CTG GTG CTG CTC GGT CAC TCT CTG 48 



SEQ ID No 6 

SEQUENCE LENGTH: 51 bases 

SEQUENCE TYPE: Nucleotide 

STRANDEDNESS: Single 

TOPOLOGY: Linear 

CGG GAT CCC CAG AGA GTG ACC GAG CAG CAC CAG TTC CTC AGG GTG 45 
20 GCA CAG 5! 

SEQ ID No 7 

SEQUENCE LENGTH: 63 bases 

25 

SEQUENCE TYPE: Nucleotide 
STRANDEDNESS : Single 
TOPOLOGY: Linear 
30 GGG ATC CCG TGG GCT CCA CTG AGC TCT TGC CCG TCC CAA GCT TTA 45 
CAA CTG GCA GGC TGC TTG 63 



SEQ ID No 8 
SEQUENCE LENGTH: 60 bases 
SEQUENCE TYPE: Nucleotide 
STRANDEDNESS: Single 
40 TOPOLOGY: Linear 

C1U GCT CAA GCA GCC TGC CAG TTG TAA AGC TTG GGA CGG GCA AGA 45 
GCT CAG TGG AGC CCA 60 



50 



55 
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SEQ ID No 9 

SEQUENCE LENGTH: 63 bases 
SEQUENCE TYPE: Nucleotide 
STRANDEDNESS: Single 
TOPOLOGY: Linear 

AGC CAG CTG CAC TCC GGT CTG TTC CTG TAC CAG GGT CTG CTG CAG 
GCT CTA GAA GGC ATC TCT 

SEQ ID No 10 

SEQUENCE LENGTH: 63 bases 
SEQUENCE TYPE: Nucleotide 
STRANDEDNESS: Single 
TOPOLOGY: Linear 

TTC AGG AGA GAT GCC TTC TAG AGC CTG CAG CAG ACC CTG GTA CAG 
GAA CAG ACC GGA GTG CAG 

SEQ ID No 11 

SEQUENCE LENGTH: 60 bases 
SEQUENCE TYPE: Nucleotide 
STRANDEDNESS: Single 
TOPOLOGY: Linear 

CCT GAA TTG GGG CCC ACC CTG GAC ACA CTG CAG CTG GAC GTT GCC 
GAC TTC GCT ACT ACC 

SEQ ID No 12 

SEQUENCE LENGTH: 63 bases 
SEQUENCE TYPE: Nucleotide 
STRANDEDNESS: Single 
TOPOLOGY: Linear 

TTG CCA TAT GGT AGT AGC GAA GTC GGC AAC GTC CAG CTG CAG TGT 
GTC CAG GGT GGG CCC CAA 
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SEQ ID No 13 

SEQUENCE LENGTH: 63 bases 
SEQUENCE TYPE: Nucleotide 
STRANDEDNESS: Single 
TOPOLOGY: Linear 

ATA TGG CAA CAG ATG GAG GAA CTG GGT ATG GCT CCG GCA CTG CAG 
CCG ACT CAG GGT GCG ATG 

SEQ ID No 14 

SEQUENCE LENGTH: 60 bases 
SEQUENCE TYPE: Nucleotide 
STRANDEDNESS: Single 
TOPOLOGY: Linear 

TGC TGG CAT CGC ACC CTG AGT CGG CTG CAG TGC CGG AGC CAT ACC 
CAG TTC CTC CAT CTG 

SEQ ID No 15 

SEQUENCE LENGTH: 60 bases 
SEQUENCE TYPE: Nucleotide 
STRANDEDNESS: Single 
TOPOLOGY: Linear 

CCA GCA TTC GCC TCT GCT TTC CAG CGG CGC GCA GGC GGT GTT CTG 
GTT GCC TCC CAT CTT 

SEQ ID No 16 

SEQUENCE LENGTH: 60 bases 
SEQUENCE TYPE: Nucleotide 
STRANDEDNESS : Single 
TOPOLOGY: Linear 

GCT CTG AAG ATG GGA GGC AAC CAG AAC ACC GCC TGC GCG CCG CTG 
GAA AGC AGA GGC GAA 
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SBQ ID No 17 

SEQUENCE LENGTH: 55 bases 
SEQUENCE TYPE: Nucleotide 
STRANDEDNESS: Single 
TOPOLOGY: Linear 

CAG AGC TTC CTC GAG GTG TCT TAC CGC GTT CTG CGT CAC CTG GCC 
CAG CCG TTAG 

SEQ ID No 18 
SEQUENCE LENGTH: 
SEQUENCE TYPE: 
STRANDEDNESS: 
TOPOLOGY : 
TCGACTTA CGG CTG 
GAG GAA 

SEQ ID No 19 

SEQUENCE LENGTH: 21 bases 
SEQUENCE TYPE: Nucleotide 
STRANDEDNESS: Single 
TOPOLOGY: Linear 
TACAACTGGCAGGCTGCTTGA 

SEQ ID No 20 

SEQUENCE LENGTH: 21 bases 
SEQUENCE TYPE: Nucleotide 
STRANDEDNESS : Single 
TOPOLOGY: Linear 
^ACGTTGCCGACTTCGCTACT 

SEQ ID No 21 

SEQUENCE LENGTH: 21 bases 
SEQUENCE TYPE: Nucleotide 
STRANDEDNESS: Single 
TOPOLOGY: Linear 
TGCCGGAGCCATACCCAGTTC 



53 bases 
Nucleotide 
Single 
Linear 

GGC CAG GTG ACG CAG AAC GCG GTA AGA CAC CTC 



39 



EP 0 459 630 A2 



SEQ ID No 22 

SEQUENCE LENGTHS 21 bases 
SEQUENCE TYPE: Nucleotide 
STRANDEDNESS: Single 
T0P0L0CT: Linear 
GCCTGCCAGTTGTAAA6CTTG 

SEQ ID No 23 

SEQUENCE LENGTH: 26 bases 
SEQUENCE TYPE: Nucleotide 
STRANDEDNESS: Single 
TOPOLOGY: Linear 
GCACC^TCGCCTTGAATTTTACGTAG 

SEQ ID No 24 

SEQUENCE LENGTH: 62 bases 
SEQUENCE TYPE: Nucleotide 
STRANDEDNESS: Single 
TOPOLOGY: Linear 

AATTCAGT ACT CCA CTG GGT CCA GCA AGC TCT CTG CCG CAG TCT TTC 
CTG CTG AAG TCT CTC 

SEQ ID No 25 

SEQUENCE LENGTH: 64 bases 
SEQUENCE TYPE: Nucleotide 
STRANDEDNESS: Single 
TOPOLOGY: Linear 

CTG TTC GAG AGA CTT CAG CAG GAA AGA CTG CGG CAG AGA GCT TGC 
TuJ ACC CAG TGG AGT ACTG 
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SEQ ID No 26 

SEQUENCE LENGTH: 60 bases 
SEQUENCE TYPE: Nucleotide 
STRANDEDNESS: Single 
TOPOLOGY: Linear 

GAA CAG GTA CGT AAA ATT CAA GGC AGC GGT GCG GCT CTG CAG GAA 
AAG CTG TGC GCA ACC 

SEQ ID No 27 

SEQUENCE LENGTH: 60 bases 
SEQUENCE TYPE: Nucleotide 
STRANDEDNESS: Single 
TOPOLOGY: Linear 

TTT GTA GGT TGC GCA CAG CTT TTC CTG CAG AGC CGC ACC GCT GCC 
TTG AAT TTT ACG TAC 
SEQ ID No 28 

SEQUENCE LENGTH: 29 bases 

SEQUENCE TYPE: Nucleotide 

STRANDEDNESS: Single 

TOPOLOGY: Linear 

CTT CAG CAG GAA AGA ACG CGG CAG AGA GC 

SEQ ID No 29 

SEQUENCE LENGTH: 33 bases 
SEQUENCE TYPE: Nucleotide 
STRANDEDNESS: Single 
TOPOLOGY: Linear 

GC TTG GGA AGA GCA AGA GCT CAG AGA AGC CCA C 
SEQ ID No 30 

SEQUENCE LENGTH: 40 bases 
SEQUENCE TYPE: Nucleotide 
STRANDEDNESS: Single 
TOPOLOGY: Linear 

CTG TTG CCA TAT GCT AGA AGC GAA GTC TTC AAC GTC CAG C 
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SEQ ID No 31 

SEQUENCE LENGTH: 27 bases 
SEQUENCE TYPE: nucleotide 
STRANDBDNBSS: Single 
TOPOLOGY: Linear 
GCT CAG TGG AGC TTT CGG GAT CCC CAG 

SEQ 10 No 32 

SEQUENCE LENGTH: 27 bases 
SEQUENCE TYPE: Nucleotide 
STRANDEONESS: Single 
TOPOLOGY: Linear 
ACG CAG AAC GCG GCG AGA CAC CTC GAG 

SEQ ID No 33 

SEQUENCE LENGTH: 29 bases 

SEQUENCE TYPE: Nucleotide 

STRANDEDNESS: Single 

TOPOLOGY: Linear 

G TTC GAG AGA CTT TTC CAG GAA AGA CTG C 

SEQ ID No 34 

SEQUENCE LENGTH: 33 bases 
SEQUENCE TYPE: Nucleotide 
STRANDEDNESS : Single 
TOPOLOGY: Linear 

C CTG CAG TTT CGC AGC GCT AGC TTG AAT TTT AC 
£7) ID No 35 

SEQUENCE LENGTH: 37 bases 
SEQUENCE TYPE: Nucleotide 
STRANDEDNESS: Single 
TOPOLOGY: Linear 

CAG AGA GTG AGC GAG CTT CAC CAG TTC CTC AGC GTG G 
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SEQ ID No 36 

SEQUENCE LENGTH: 29 bases 

SEQUENCE TYPE: Nucleotide 

STRANDEDNBSS: Single 

TOPOLOGY: Linear 

GCT CAG TGG AGC TTT CGG GAT AGC CAG AG 

SEQ ID No 37 

SEQUENCE LENGTH: 30 bases 

SEQUENCE TYPE: Nucleotide 

STRANDEDNBSS: Single 

TOPOLOGY: Linear 

CAG CTT TTC CTG CAG ACG CGC AGC GCT AGC 

SEQ ID No 38 

SEQUENCE LENGTH: 29 bases 

SEQUENCE TYPE: Nucleotide 

STRANDEDNBSS: Single 

TOPOLOGY: Linear 

CC GCT GCC TTG AAT ACG ACG TAC CTG TTC 

SEQ ID No 39 

SEQUENCE LENGTH: 30 bases 

SEQUENCE TYPE: Nucleotide 

STRANDEDNBSS: Single 

TOPOLOGY: Linear 

GGT TGC GCA CAG ACG TTC CTG CAG AGC CGC 

<;EQ ID No 40 

SEQUENCE LENGTH: 29 bases 

SEQUENCE TYPE: Nucleotide 

STRANDEDNBSS: Single 

TOPOLOGY: Linear 

G GTG GCA CAG ACG GTA GGT TGC GCA CAG C 
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SEQ ID No 41 

SEQUENCE LENGTH: 45 bases 
SEQUENCE TYPE: Nucleotide 
STRANDBDNESS : Single 
TOPOLOGY: Linear 

CG CGG CAG AGA GCT TGC ACG GTA GGT TGG AGC CAT TGTCGATACC 
SEQ ID No 42 

SEQUENCE LENGTH: 24 bases 
SEQUENCE TYPE: Nucleotide 
STRANDBDNESS: Single 
TOPOLOGY: Linear 
GCA AGA GCT CAG AGA AGC CCA CGG 

SEQ ID No 43 

SEQUENCE LENGTH: 39 bases 
SEQUENCE TYPE: Nucleotide 
STRANDBDNESS: Single 
TOPOLOGY: Linear 

CA GCC TGC CAG TTG TAA AGC TTG GGA GCT GCA AGA GCT C 
SEQ ID No 44 

SEQUENCE LENGTH: 27 bases 
SEQUENCE TYPE: Nucleotide 
STRANDBDNESS : Single 
TOPOLOGY: Linear 
GCT CAG AGA AGC TTT CGG GAT CCC CAG 

SEu ID No 45 

SEQUENCE LENGTH: 46 bases 
SEQUENCE TYPE: Nucleotide 
STRANDBDNESS: Single 
TOPOLOGY: Linear 

CGG GAT AGC CAG AGA GTG AGC GAG TTT CAC CAG TTC CTC AGC GTG 
SEQ ID No 46 

SEQUENCE LENGTH: 174/177 Amino acids 
SEQUENCE TYPE: Amino acid 
TOPOLOGY: Linear 



EP 0 459 630 A2 



Thr Pro Leu Gly Pro Ala Ser Ser Leu Pro Gin 

1 5 10 

Ser Phe Leu Leu Lys Cys Leu Glu Gin Val Arg 

15 20 

Lys lie Gin Gly Asp Gly Ala Ala Leu Gin Glu 
25 30 



Lys Leu (Val Ser Glu) m Cys Ala Thr Tyr Lys Leu 
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40 



Cys His Pro Glu Glu Leu Val Leu Leu Gly His 
45 50 

Ser Leu Gly lie Pro Trp Ala Pro Leu Ser Ser 
55 60 

Cys Pro Ser Gin Ala Leu Gin Leu Ala Gly Cys 
65 70 

Leu Ser Gin Leu His Ser Gly Leu Phe Leu Tyr 
75 80 85 

Gin Gly Leu Leu Gin Ala Leu Glu Gly lie Ser 

90 95 



Pro Glu Leu Gly Pro Thr Leu Asp Thr Leu Gin 
100 105 
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Leu Asp Val Ala Asp Phe Ala Thr Thr lie Trp 

110 115 

Gin Gin Met Glu Glu Leu Gly Met Ala Pro Ala 

120 125 



10 



Leu Gin Pro Thr Gin Gly Ala Met Pro Ala Phe 
130 135 140 



15 



Ala Ser Ala Phe Gin Arg Arg Ala Gly Gly Val 

145 150 



.20 



Leu Val Ala Ser His Leu Gin Ser Phe Leu Glu 
155 160 



25 



Val Ser Tyr Arg VStl Leu Arg His Leu Ala Gin 
165 170 



Pro (where m is O or 1). 



SEQ ID No 47 
SEQUENCE LENGTH: 
SEQUENCE TYPE: 
STRANDEDNESS: 
TOPOLOGY: 



168 + 166 bases 
Nucleotide 
Double 
Linear 



40 

\ 



50 



55 



AATTCTGGCA AATATTCTGA AATGAGCTGT TGACAATTAA TCATCGAACT 50 

GACCGT TTATAAGACT TTACTCGACA ACTGTTAATT AGTAGCTTGA 46 

AGTTAACTAG TACGCAAGTT CACGTAAAAA GGGTATCGAC 90 

TCAATTGATC ATGCGTTCAA GTGCATTTTT CCCATAGCTG 86 

AATGGTACCC GGGGATCCTC TAGAGTCGAC CTGCAGGCAT GCAAGCTTAG 140 

TTACCATGGG CCCCTAGGAG ATCTCAGCTG GACGTCCGTA CGTTCGAATC 136 

CCCGCCTAAT GAGCGGGCTT* TTTTTTAT 168 

GGGCGGATTA CTCGCCCGAA AAAAAATAGC 166 
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SEQ ID No 48 

SEQUENCE LENGTH: 534 bases 

SEQUENCE TYPE: Nucleotide with corresponding protein 

STRANDEDNESS: Single 

TOPOLOGY: Linear 



10 



AATTCAGT ACT CCA CTG GGT CCA GCA AGC TCT CTG CCG CAG TCT TTC CTG 
Thr Pro Leu Gly Pro Ala Ser Ser Leu Pro Gin Ser Phe Leu 
15 10 



50 



15 



20 



25 



CTG AAG TGT CTC GAA CAG GTA CGT AAA ATT CAA GGC GAT GGT GCG GCT 98 
Leu Lys Cys Leu Glu Gin Val Arg Lys lie Gin Gly Asp Gly Ala Ala 
15 20 25 30 

CTG CAG GAA AAG CTG TGC GCA ACC TAC AAA CTG TGC CAC CCT GAG GAA 146 
Leu Gin Glu Lys Leu Cys Ala Thr Tyr Lys Leu Cys His Pro Glu Glu 
35 40 45 



30 



35 



40 



45 



50 



55 
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10 



15 



20 



30 



35 



40 



SO 



CTG GTG CTG CTC GGT CAC TCT CTG GGG ATC CCG TGG GCT CCA CTG AGC 194 
Leu Val Leu Leu Gly His Ser Leu Gly He Pro Trp Ala Pro Leu Ser 
50 55 60 

TCT TGC CCG TCC CAA GCT TTA CAA CTG GCA GGC TGC TTG AGC CAG CTG 242 
Ser Cys Pro Ser Gin Ala Leu Gin Leu Ala Gly Qys Leu Ser Gin Leu 
65 70 75 

CAC TCC GGT CTG TTC CTG TAC CAG GGT CTG CTG CAG GCT CTA GAA GGC 290 
His Ser Gly Leu Phe Leu Tyr Gin Gly Leu Leu Gin Ala Leu Glu Gly 
80 85 90 

ATC TCT CCT GAA TTG GGG CCC ACC CTG GAC ACA CTG CAG CTG GAC GTT 338 
He Ser Pro Glu Leu Gly Pro Thr Leu Asp Thr Leu Gin Leu Asp Val 
95 100 105 HO 

GCC GAC TTC GCT ACT ACC ATA TGG CAA CAG ATG GAG GAA CTG GGT ATG 386 
Ala Asp Phe Ala Thr Thr He Trp Gin Gin Met Glu Glu Leu Gly Met 
115 120 125 

GCT CCG GCA CTG CAG CCG ACT CAG GGT GCG ATG CCA GCA TTC GCC TCT 434 
Ala Pro Ala Leu Gin Pro Thr Gin Gly Ala Met Pro Ala Phe Ala Ser 
130 135 140 

GCT TTC CAG CGG CGC GCA GGC GGT GTT CTG GTT GCC TCC CAT CTT CAG 482 
Ala Phe Gin Arg Arg Ala Gly Gly Val Leu Val Ala Ser His Leu Gin 
145 145 155 

\ 

AGC TTC CTC GAG GTG TCT TAC CGC GTT CTG CGT CAC CTG GCC CAG CCG 530 
Ser Phe Leu Glu Val Ser Tyr Arg Val Leu Arg His Leu Ala Gin Pro 
160 165 170 174 



TAA G 



SEQ ID No 49 

SEQUENCE LENGTH: 534 bases 

SEQUENCE TYPE: Nucleotide vith corresponding protein 

STRANDEDNESS : Single 

TOPOLOGY: Linear 



534 
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AATTCAGT ACT CCA CTG GGT CCA GCA AGC TCT CTG CCG CAG TCT TTC CTG 50 
Thr Pro Leu Gly Pro Ala Ser Ser Leu Pro Gin Ser Phe Leu 
1 5 10 

5 

CTG AAG TCT CTC GAA CAG GTA CGT AAA ATT CAA GGC AGC GGT GCG GCT 98 
Leu Lys Ser Leu Glu Gin Val Arg Lys He Gin Gly Ser Gly Ala Ala 
io 15 20 25 30 

CTG CAG GAA AAG CTG TGC GCA ACC TAC AAA CTG TGC CAC CCT GAG GAA 146 
is Leu Gin Glu Lys Leu Cys Ala Thr Tyr Lys Leu Cys His Pro Glu Glu 

35 40 45 

CTG GTG CTG CTC GGT CAC TCT CTG GGG ATC CCG TGG GCT CCA CTG AGC 194 
20 Leu Val Leu Leu Gly His Ser Leu Gly He Pro Trp Ala Pro Leu Ser 
50 55 60 

25 TCT TGC CCG TCC CAA GCT TTA CAA CTG GCA GGC TGC TTG AGC CAG CTG 242 
Ser Cys Pro Ser Gin Ala Leu Gin Leu Ala Gly Cys Leu Ser Gin Leu 
65 70 75 



30 



35 



40 



CAC TCC GGT CTG TTC CTG TAC CAG GGT CTG CTG CAG GCT CTA GAA GGC 290 
His Ser Gly Leu Phe Leu Tyr Gin Gly Leu Leu Gin Ala Leu Glu Gly 
80 85 90 

ATC TCT CCT GAA TTG GGG CCC ACC CTG GAC ACA CTG CAG CTG GAC GTT 338 
He Ser Pro Glu Leu Gly Pro Thr Leu Asp Thr Leu Gin Leu Asp Val 
95 100 105 110 



\ 



60 



55 
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GCC GAC TTC GCT ACT ACC ATA TGG CAA CAG ATG GAG GAA CTG GGT ATG 
Ala Asp Phe Ala Thr Thr He Trp Gin Gin Met Glu Glu Leu Gly Met 
115 120 125 



386 



10 



GCT CCG GCA CTG CAG CCG ACT CAG GGT GCG ATG CCA GCA TTC GCC TCT 
Ala Pro Ala Leu Gin Pro Thr Gin Gly Ala Met Pro Ala Phe Ala Ser 
130 135 140 



434 



15 



GCT TTC CAG CGG CGC GCA GGC GGT GTT CTG GTT GCC TCC CAT CTT CAG 
Ala Phe Gin Arg Arg Ala Gly Gly Val Leu Val Ala Ser His Leu Gin 
145 145 155 



482 



20 



AGC TTC CTC GAG GTG TCT TAC CGC GTT CTG CGT CAC CTG GCC CAG CCG 
Ser Phe Leu Glu Val Ser Tyr Arg Val Leu Arg His Leu Ala Gin Pro 
160 165 170 174 



530 



TAA G 



534 



30 



35 



SEQ ID No 50 
SEQUENCE LENGTH: 
SEQUENCE TYPE: 
STRANDEONESS: 
TOPOLOGY: 



81 bases 
Nucleotide 
Single 
Linear 



GAATTCAACA AAACGGTTGA CAACATGAAG TAAACACGGT ACGATGTACC 50 
ACAAGTTCAC GTAAAAAGGG TATCGACAATG 81 



40 



45 



SEQ ID No 51 
SEQUENCE LENGTH: 
SEQUENCE TYPE: 
STRANDEDNESS: 
TOPOLOGY: 



67 + 67 bases 
Nucleotide 
Double 
Linear 



so TCGACATTAT ATTACTAATT AATTGGGGAC CCTAGAGGTC CCCTTTTTTA TTTTAAAAAG 60 
GTAATA TAATGATTAA TTAACCCCTG GGATCTCCAG GGGAAAAAAT AAAATTTTTC 56 



55 



CATGCGA 
GTACGCTTCGA 



67 
67 
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SEQ ID No 52 
SEQUENCE LENGTH: 
SEQUENCE TYPE: 
STRANDEDNESS: 
TOPOLOGY: 



72 + 72 bases 
Nucleotide 
Double 
Linear 



10 

TCGACATTAT ATTACTAATT AATTGGGGAC CCTAGAGGTC CCCTTTTTTA TTTTAAAAG 60 
GTAATA TAATGATTAA TTAACCCCTG. GGATCTCCAG GGGAAAAAAT AAAATTTTC 56 

« CATGCGGATC CC 72 

GTACGCCTAG GGGAAC 72 



20 



25 



SEQ ID No 53 
SEQUENCE LENGTH: 
SEQUENCE TYPE: 
STRANDEDNESS: 
TOPOLOGY: 



118 bases 
Nucleotide 
Single 
Linear 



30 



AATTCTGGCA AATATTCTGA AATGAGCTGT TGACAATTAA TCATCGAACT 50 
AGTTAACTAG TACGCAGAGC TCAATCTAGA GGGTATTAAT AATGTTCCCA 100 
TTGGAGGATG ATTAAATG 118 



40 



SEQ ID No 54 
SEQUENCE LENGTH: 
SEQUENCE TYPE: 
STRANDEDNESS: 
TOPOLOGY: 



35 + 35 bases 
Nucleotide 
Double 
Linear 



45 



AGCTCCATAT GGTACCAGAT CTCTCGAGAG TACTT 

GGTATA CCATGGTCTA GAGAGCTCTC ATGAAGATC 



35 
35 



50 



55 
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SEQ ID No 55 
SEQUENCE LENGTH: 
SEQUENCE TYPE: 
STRANDBDNESS: 
TOPOLOGY: 



23 + 15 bases 
Nucleotide 
Double 
Linear 



AGCTCAGCTG CAGCATATGG TAC 
GTCGAC GTCGTATAC 



SEQ ID No 56 
SEQUENCE LENGTH: 
SEQUENCE TYPE: 
STRANDBDNESS : 
TOPOLOGY: 



72 + 72 bases 
Nucleotide 
Double 
Linear 



90 



TCGACATTAT ATTACTAATT AATTGGGGAC CCTAGAGGTC CCCTTTTTTA TTTTAAAAAG 60 
GTAATA TAATGATTAA TTAACCCCTG GGATCTCCAG GGGAAAAAAT AAAATTTTTC 56 



35 



CATGCGGATC CC 
GTACGCCTAG GGGAAC 



72 
72 



40 



45 



SEQ ID No 57 
SEQUENCE LENGTH: 
SEQUENCE TYPE: 
STRANDEDNESS: 
TOPOLOGY: 



84 bases 
Nucleotide 
Single 
Linear 



AAT TCA ACA AAA CGG TTG ACA ACA TGA AGT AAA CAC GGT ACG ATG 
TAC CAC AAG TTC ACG TAA AAA GGG TAT CGA CAA TGG TAC 



45 
84 



50 



55 



SEQ ID No 58 
SEQUENCE LENGTH: 
SEQUENCE TYPE: 
STRANDEDNESS: 
TOPOLOGY: 



76 bases 
Nucleotide 
Single 
Linear 
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CAT TGT CGA TAC CCT TTT TAC GTG AAC TTG TGG TAC ATC GTA CCG 
TGT TTA CTT CAT GTT GTC AAC CGT TTT GTT G 



45 
76 



10 



SEQ ID No 59 

SEQUENCE LENGTH: 24 + 24 bases 

SEQUENCE TYPE: Nucleotide 

STRANDEDNESS: Double 

TOPOLOGY: Linear 



15 



AATTCGCATG CGGATCCATC GATC 24 
GCGTAC GCCTAGGTAG CTAGAGCC 24 



20 

Claims 

1. A derivative of naturally occurring G-CSF having at least one of the biological properties of naturally occur- 
ring G^CSF and a solution stability (as herein defined) of at least 35% at 5 mg/ml, the said derivative having 

25 at least Cys 17 of the native sequence replaced by a Ser 17 residue and Asp 27 of the native sequence rep- 

laced by a Sef 27 residue. 

2. A derivative as claimed in claim 1 having at least one further modification selected from;- 

a) Leu 15 of the native sequence replaced by a Glu 15 residue; 
so b) Lys 23 of the native sequence replaced by an Arg 23 residue; 

c) Gly 28 of the native sequence replaced by an Ala 26 residue; 

d) Gly 28 of the native sequence replaced by an Ala 28 residue; 

e) Ala 30 of the native sequence replaced by a Lys 30 or Arg 30 residue; 

f) Lys 34 of the native sequence replaced by an Arg 3 * residue; 
35 g) Lys 40 of the native sequence replaced by an Arg 40 residue; 

h) Pro 44 of the native sequence replaced by an Ala 44 residue; 

i) Leu 49 of the native sequence repiaced by a Lys 49 residue; 
j) Gly 51 of the native sequence replaced by an Ala 61 residue; 
k) Gly 55 of the native sequence replaced by an Ala 55 residue; 

40 I) Trp 68 of the native sequence replaced by a Lys 68 residue; 

m) Pro 80 of the native sequence replaced by a Ser 60 residue; 

n) Pro 65 of the native sequence repiaced by a Ser 65 residue; 

o) Pro 111 of the native sequence replaced by a Glu 111 residue; 

p) Thr 115 of the native sequence replaced by a Ser 116 residue; 
45 q) Thr 118 of the native sequence replaced by a Ser" 8 residue; and 

r) Tyr 185 of the native sequence replaced by an Arg 185 residue. 

3. A derivative as claimed in claim 1 or claim 2 wherein the further modification comprises at least one of 
the following:- 

so a) Gin 11 , Pro 80 ' 85 of the native sequence replaced by Arg 11 , Ser**. 88 ; 

b) Ala 111 , Thr 11 * 118 of the native sequence replaced by Glu 111 , Ser 118 * 118 ; 

c) GIn 11 » Trp 58 . Tyr 168 of the native sequence replaced by Arg 11 ' 185 , Lys 58 ; 

d) Leu 15 Gly 28 - 28 , Ala 30 of the native sequence replaced by Glu 15 Ala 28 * 28 , Lys^or 

e) Pro 44 , Leu 49 , Gly 51 * 58 , Trp 58 of the native sequence replaced by Lys 49 * 58 , Ala 44 * 1 * 5 . 
65 f) Leu 15 ,Gly 28 ' 28 ,Ala 3 ° of the native sequence replaced by Glu 15 , Ala 28 ' 28 Arg 30 ; or 

g) Pro 85 of the native sequence replaced by Ser 85 . 

4. A derivative as claimed in any one of the proceeding claims selected from:- 
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[Aig 1 \Ser^.w,e6] hu G-CSF; 

[Glu« Ser^^a^Lys^hu G-CSF; 

[Arg^.Glu^.Ser^^eo^^a^Lys^hu G-CSF; 

[Argiu« Glu« SerW*™** Ala 28 - 28 .Lys^hu G-CSF; 
5 [Argii^.Seriww.85j hu G-CSF; 

[Arg 1 w f Seri7^7,flo^sj hu G-CSF; 

[Glu 1 V 11 i Ser ,7 -2 7 « 60 ^ 5 » 11s * 1 « f Ala 26 ^ 8 Lys^Jhu G-CSF 

[AIai f "mr 3 Jyr* > Aig^ 1 ,Seriw«o.«5jhu G-CSF 

[Glu«,Sef<7 1 27 f Ala2W8 t Arg30]hu G-CSF 
w ■ [Ser^^hu G-CSF 

[Aig^Sei^^hu G-CSF 

[Serirawjhu G-CSF 

said derivative, if desired having a presequence methionine. 

15 5. A DNA sequence encoding all or part of the amino acid sequence of a derivative as claimed in any one 
of the preceding claims. 

6. A recombinant vector containing a DNA sequence as defined in claim 5. 

20 7. A process for the preparation of a recombinant vector as defined in claim 6 which comprises inserting a 
DNA sequence as defined in claim 5 into a vector. 

8. A procaryoticor eucaryotic host cell stably transformed or transfected with a recombinant vector as defined 
in claim 6. 

25 

9. A process for the preparation of a procary otic or eucaryotic host ceil as defined in claim 8 which comprises 
transforming or transfecting a procaryotic or eucaroytic cell with a recombinant vector as defined in claim 
6 whereby to yield a stably transformed or transfected procaryotic or eucaryotic host 

30 10. A process for the preparation of a derivative of naturally occurring G-CSF as defined in any one of claims 
1 to 4 which comprises culturing a procaryotic or eucaryotic host cell as defined in claim 8 whereby to obtain 
said derivative. 

11. A pharmaceutical composition comprising as active ingredient at least one derivative of naturally occurring 
35 G-CSF as claimed in any one of claims 1 to 4 in association with a pharmaceutically aceeptable carrier 

orexciplent 

12. A method for providing haematopoietic therapy to a mammal which comprises administering an effective 
amount of a derivative as claimed in any one of claims 1 to 4. 

40 

13. A method for arresting the proliferation of leukaemic cells which comprises administering an effective 
amount of a derivative as defined in any of claims 1 to 4. 

14. A process for extracting a derivative as claimed in any one of claims 1 to 4 from an inclusion body thereof 
45 which comprises 1) suspending said inclusion body in a detergent, 2) oxidation, 3) removal of detergent 

and 4) maintaining the solution obtained following removal of detergent at an elevated temperature 
whereby to precipitate contaminating bacterial protein, product oligomers and/or degradation products, 
whilst retaining said derivative in solution in active form. 

so 15. A process as claimed in claim 14 wherein the derivative to be extracted has a solution stability of at least 
85%at10mgmil. 

Claims for the following Contracting state: GR 

55 1. A DNA sequence encoding all or part of the amino acid sequence of a derivative of naturally occurring 
G-CSF which derivative has at least one of the biological properties of naturally occurring G-CSF and a 
solution stablity (as herein defined) of at least 35% at 5 mg/ml, the said derivative having at least Cys 17 
of the native sequence replaced by a Ser 17 residue and Asp 27 of the native sequence replaced by a Ser 27 
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residue. 

2. A recombinant vector containing a DNA sequence as defined in claim 1 . 

3. A process for the preparation of a recombinant vector as defined in claim 2 which comprises inserting a 
DNA sequence as defined in claim 1 1nto a vector. 

4. A procaryotic or eucaryotic host cell stably transformed or transfected wfth a recombinant vector as defined 
in claim 2. 

5. A process for the preparation of a procaryotic or eucaryotic host cell as defined in ciaim 4 which comprises 
transforming or transfecting a procaryotic or eucaroytic cell with a recombinant vector as defined in claim 
2 whereby to yield a stably transformed or transfected procaryotic or eucaryotic host 

6. A process for the preparation of a derivative of naturally occurring G-CSF which derivative has at least 
one of the biological properties of naturally occurring G-CSF and a solution stability (as herein defined) 
of at least 35% at 5 mg/ml, the said derivative having at least Cys t7 of the native sequence replaced by 
a Ser 17 residue and Asp 27 of the native sequence replaced by a Ser 27 residue, which process comprises 
culturing a procaryotic or eucaryotic host cell as defined in claim 4 whereby to obtain said derivative. 

7. A process for extracting from an inclusion body thereof, a derivative of naturally occurring G-CSF having 
at least one of the biological properties of naturally occurring G-CSF and a solution stability (as herein 
defined) of at least 35% at 5 mg/ml, the said derivative having at least Cys 17 of the native sequence rep- 
laced by a Ser 17 residue and Asp 27 of the native sequence replaced by a Ser 27 residue, which process 
comprises 

1) suspending said Inclusion body in a detergent, 2) oxidation, 3) removal of detergent and 4) maintaining 
the solution obtained following removal of detergent at an elevated temperature whereby to precipitate 
contaminating bacterial protein, product oligomers and/or degradation products, whilst retaining said deri- 
vative in solution in active form. 

8. A process as claimed in claim 7 wherein the derivative to be extracted has a solution stability of at least 
85% at 10 mg/ml. 

Claims for the following Contracting state: ES 

1. A process for the preparation of a derivative of naturally occurring G-CSF having at least one of the biologi- 
cal properties of naturally occurring G-CSF and a solution stability (as herein defined) of at least 35% at 
5 mg/ml, the said derivative having at least Cys 17 of the native sequence replaced by a Ser 17 residue and 
Asp 27 of the native sequence replaced by a Ser 27 residue, which process comprises culturing a procaryotic 
or eucaryotic host cell whereby to obtain said derivative, said host cell being a procaryotic or eucaryotic 
ceil stably transformed or transfected with a recombinantvector in which the recombinant vector comprises 
a vector having inserted therein a DNA sequence encoding the said derivative. 

2. A process for the preparation of a recombinant vector which comprises i nserting into a vector a DNA sequ- 
ence encoding all or part of the amino acid sequence of a derivative of naturally occurring G-CSF, which 
derivative has at least one of the biological properties of naturally occurring G-CSF and a solution stability 
of at least 35% at 5 mg/ml and which further has at least Cys 17 of the native sequence replaced by a Ser 17 
residue and Asp 27 of the native sequence replaced by a Ser 27 residue. 

3. A process for the preparation of a prowryoticoreucaryotichostcellwhlchajmprisestransfomiingortrans- 
fecting a procaryotic or eucaryotic cell with a recombinantvector as defined in claim 2 whereby to yield a 
stably transformed or transfected procaryotic or eucaryotic host cell. 

4. A process for extracting, from an Inclusion body thereof, a derivative of naturally occurring G-CSF having 
at least one of the biological properties of naturally occurring G-CSF and a solution stability (as herein 
defined) of at least 35% at 5 mg/ml, the said derivative having at least Cys 17 of the native sequence rep- 
laced by a Ser 17 residue and Asp 27 of the native sequence replaced by a Ser27 residue, which process 
comprises 
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1) suspending said inclusion body in a detergent, 2) oxidation, 3) removal of determent and 4) maintaining 
the solution obtained following removal of detergent at an elevated temperature whereby to precipitate 
contaminating bacterial protein, product oligomers and/or degradation products, whflst retaining said deri- 
vative in solution in active form. 

5 

5* A process as claimed in claim 4 wherein the derivative to be extracted has a solution stability of at least 
85% at 10 mg/mf. 
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FIGURE 1 
EcoRl 

AATTCTGGCA AATATTCTGA AATGAGCTGT TGACAATTAA TCATCGAACT 50 
GACCGT TTATAAGACT TTACTCGACA ACTGTTAATT AGTAGCTTGA 46 

Hpal 

AGTTAACTAG TACGCAAGTT CAOGTAAAAA GGGTATCGAC 90 
TCAATTGATC ATGCGTTCAA GTGCATTTTT CCCATAGCTG 86 

Rpnl BamHI Xbal Sail PstI SphI 
AATGGTACCC GGGGATCCTC TAGAGTCGAC CTGCAGGCAT GCAAGCTTAG 140 
TTACCATGGG CCCCTAGGAG* ATCTCAGCTG GACGTCCGTA CGTTCGAATC 136 

Clal 

CCCGCCTAAT GAGCGGGCTT TTTTTTAT 168 
GGGCGGATTA CTCGCCCGAA AAAAAATAGC 166 
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FIGURE 2 
BcoRI Seal 

AATTCAGT ACT CCA CTG GGT CCA GCA AGC TCI CTG COG CAG TCT TTC CIG CIG AAG TGT 59 
GTCA TGA GGT GAC CCA GGT CGT TOG AGA GAC GGC GTC AGA AAG GAC GAC TTC. ACA 
Thr Pro Leu Gly Pro Ala Ser Ser Leu Pro Gin Ser Phe Leu Leu Lys Cys 
15 10 15 

SnabI Pspl 

CTC GAA CAG GTA CGT AAA ATT CAA GGC GAT GGT GCG GCT CTG CAG GAA AAG CTG TGC GCA 119 
GAG CTT GTC CAT GCA TTT TAA GTT CCG CTA CCA CGC CGA GAC GTC CTT TTC GAC ACG CGT 
Leu Glu Gin Val Arg Lys He Gin Gly Asp Gly Ala Ala Leu Gin Glu Lys Leu Cys Ala 
20 25 30 35 

MstH BamHI 

ACC TAC AAA CTG TGC CAC CCT GAG GAA CTG GTG CTG CTC GGT CAC TCT CTG GGG ATC CCG 179 
TGG ATG TTT G AC ACG GTG GGA CTC CTT GAC CAC GAC GAG CCA GTG AGA GAC CCC TAG GGC 
Thr Tyr Lys Leu Cys His Pro Glu Glu Leu Val Leu Leu Gly His Ser Leu Gly He Pro 
40 45 50 55 

Sad BindllX 

TGG GCT CCA CTG AGC TCT TGC CCG TCC CAA GCT TTA CAA CTG GCA GGC TGC TTG AGC CAG 219 
ACC CGA GGT GAC TOG AGA ACG GGC AGG GTT CGA AAT GTT GAC CGT CCG ACG AAG TCG GTC 
Trp Ala Pro Leu Ser Ser Cys Pro Ser Gin Ala Leu Gin Leu Ala Gly pys Leu Ser Gin 
60 65 70 75 

Xbal 

CTG CAC TCC GGT CTG TTC CTG TAC CAG GGT CTG CTG CAG GCT CTA GAA GGC ATC TCT CCT 299 
GAC GTG AGG CCA GAC AAG GAC ATG GTC CCA GAC GAC GTC CGA GAT CTT CCG TAG AGA GGA 
Leu His Ser Gly Leu Phe Leu Tyr Gin Gly Leu Leu Gin Ala Leu Glu Gly He Ser Pro 
80 85 90 95 

Ndel 

GAA TTG GGG CCC ACC CTG GAC ACA CTG CAG CTG GAC GTT GCC GAC TTC GCT ACT ACC ATA 359 
CTT AAC CCC GGG TGG GAC CTG TGT GAC GTC GAC CTG CAA CGG CTG AAG CGA TGA TGG TAT 
Glu Leu Gly Pro Thr Leu Asp Thr Leu Gin Leu Asp Val Ala Asp Phe Ala Thr Thr He 
100 105 HO 115 

TGG CAA CAG ATG GAG GAA CTG GGT ATG GCT CCG GCA CTG CAG CCG ACT CAG GGT GCG ATG 419 
ACC GTT GTC TAC CTC CTT GAC CCA TAC CGA GGC CGT GAC GTC GGC TGA GTC CCA CGC TAC 
Trp Gin Gin Met Glu Glu Leu Gly Met Ala Pro Ala Leu Gin Pro Thr Gin Gly Ala Met 
120 125 130 135 

BssHII 

CCA GCA TTC GCC TCT GCT TTC CAG CGG CGC GCA GGC GGT GTT CIG GTT GCC TCC CAT CTT 479 
GGT CGT AAG CGG AGA CGA AAG GTC GCC GCG CGT CCG CCA CAA GAC CAA CGG AGG -GTA GAA 
Pro Ala Phe Ala Ser Ala Phe Gin Arg Arg Ala Gly Gly Val Leu Val Ala Ser His Leu 
140 145 150 155 

Xhol S* 11 

CAG AGC TTC CTC GAG GTG TCT TAC CGC GTT CTG CGT CAC CTG GCC CAG CCG TAA G 534 
GTC TCG AAG GAG CTC CAC AGA ATG GCG CAA GAC GCA GTG GAC CGG GTC GGC ATT CAGCT 
Gin Ser Phe Leu Glu Val Ser Tyr Arg Val Leu Arg His Leu Ala Gin Pro 
160 165 170 174 
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ragogg 3 

EcoRI Seal 

AATTCAGT ACT CCA CTG GGT CCA GCA AGC TCT CTG COG CAG TCT TTC CTG CTG AA6 TCT 59 
6XCA TGA GGT GAC CCA GGT CGT TOG AGA GAC GGC GTC AGA AAG GAC GAC TTC AGA 
Thr Pro Leu Gly Pro Ala Ser Ser Leu Fro Gin Ser Phe Leu Leu Lys Ser 
15 10 15 

Snabl Fspl 



CTC 


GAA 


CAG 


GTA CGT 


AAA 


ATT CAA 


GGC 


AGC GGT 


GCG GCT CTG 


CAG GAA AAG CTG TGC GCA 119 


GAG 


CTT 


GTC 


CAT GCA 


TTT 


TAA GTT 


COG 


TCG CCA 


CGC CGA GAC 


GTC CTT TTC GAC ACG CGT 


Leu 


Glu 


Gin 


Val Arg 


Lys 


lie Gin 


Gly 


Ser Gly 


Ala Ala Leu 


Gin Glu Lys Leu Cys Ala 






20 






25 




30 


35 












MstH 








BamHX 


ACC 


TAC 


AAA 


CTG TGC 


CAC 


OCT GAG 


GAA 


CTG GTG 


CTG CTC GGT 


CAC TCT CTG GGG ATC COG 179 


TGG 


ATG 


TTT 


GAC ACG 


GTG 


GGA CTC 


CTT 


GAC CAC 


GAC GAG CCA 


GTG AGA GAC CCC TAG GGC 


Thr 


iyr 


Lys 


Leu Cys 


His 


Fro Glu 


Glu 


Leu Val 


Leu Leu Gly 


His Ser Leu Gly lie Fro 






40 






45 






50 
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Sad HindUI 



TGG 


GCT 


CCA 


CTG 


AGC 


TCT 


TGC 


COG TCC 


CAA 


GCT 


TTA CAA CTG GCA GGC TGC TTG AGC CAG 219 


ACC 


CGA 


GGT 


GAC 


TCG 


AGA 


ACG 


GGC AGG 


GTT 


CGA 


AAT GTT GAC CGT CCG ACG AAC TCG GTC 


Trp 


Ala 


Fro 


Leu 


Ser 


Ser 


Cys 


Fro Ser 


Gin 


Ala 


Leu Gin Leu Ala Gly Qys Leu Ser Gin 






60 








65 






70 75 






















Zbal 


CTG 


CAC 


TCC 


GGT 


CTG 


TTC 


CTG 


TAC CAG 


GGT 


CTG 


CTG CAG GCT CTA GAA GGC ATC TCT GCT 299 


GAC 


GTG 


AGG 


CCA 


GAC 


AAG 


GAC 


ATG GTC 


CCA 


GAC 


GAC GTC CGA GAT CTT CCG TAG AGA GGA 


Leu 


His 


Ser 


Gly 


Leu 


Fhe 


Leu 


Tyr Gin 


Gly 


Leu 


Leu Gin Ala Leu Glu Gly lie Ser Fro 






80 










85 




90 95 






















* Hdel 


GAA 


TTG 


GGG 


CCC 


ACC 


CTG 


GAC 


ACA CTG 


CAG 


CTG 


GAC GTT GCC GAC TTC GCT ACT ACC ATA 359 


CTT 


AAC 


CCC 


GGG 


TGG 


GAC 


CTG 


TGT GAC 


GTC 


GAC 


CTG CAA CGG CTG AAG CGA TGA TGG TAT 


Glu 


Leu 


Gly 


Fro 


Thr 


Leu 


Asp 


Thr Leu 


Gin 


Leu 


Asp Val Ala Asp Fhe Ala Thr Thr lie 






100 








105 






110 115 


TGG 


CAA 


CAG 


ATG 


GAG 


GAA 


CTG 


GGT ATG 


GCT 


CCG 


GCA CTG CAG CCG ACT CAG GGT GCG ATG 419 


ACC 


GTT 


GTC 


TAC 


CTC 


CTT 


GAC 


CCA TAC 


CGA 


GGC 


CGT GAC GTC GGC TGA GTC CCA CGC TAC 


Trp 


Gin 


Gin 


Met 


Glu 


Glu 


Leu 


Gly Met 


Ala 


Fro 


Ala Leu Gin Fro Thr Gin Gly Ala Met 






120 










125 






130 135 



bsshu 



CCA GCA TTC GCC TCT GCT TTC CAG CGG CGC GCA GGC GGT GTT CTG GTT GCC TCC CAT CTT 479 
GGT CGT AAG CGG AGA CGA AAG GTC GCC GCG CGT CCG CCA CAA GAC CAA CGG AGG GTA GAA 
Fro Ala Phe Ala Ser Ala Fhe Gin Arg Arg Ala Gly Gly Val Leu Val Ala Ser His Leu 
140 145 150 155 
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Xhol Sail 
CAG AGC TTC CTC GAG GTG TCT TAC OGC GTT CTG CGT CAC CTG GCC CAG 006 TAA 6 534 
GTC TOG AAG GAG CTC CAC AGA ATG GOG CAA GAC GCA GTG GAC OGG GTG GGC ATT CAGCT 
Gin Ser Phe Leu Glu Val Ser Tyr Arg Val Leu Arg Bis Leu Ala Gin Pro 
160 165 170 174 
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FIGURE 4 



TRANSCRIPTION TERMINATION SEQUENCE 

a) 

Sail 

5' TCGACATTATATTACTAATTAATTGGGGAC^ 
3 ' GTMTATMTGATTAATTM<XOT^ 

S£hl Hindlll 
AAAGCATGCA 3' 
TTTCGTACGTTCGA 5' 



b) 

Sail 

5' TCGACATTATATTACTAATTMTTGGGGACCCTAGAGGTC 

3 ' GTMTATMTGATTAATTAACCCCTGGGATCTCCAGGGGAAAAAATAAAATT 

SphI BamHI St^I 
AAAGCATGCGGATCCC 3 ' 

TTTCGTACGCCTAGGGGAAC 5' 
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FIGURE 5 
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FIGURE 6 
EcoRI 

AATTCTGGCA AATATTCTGA AATGAGCTGT TGACAATTAA TCATCGAACT 
Hpal 

AGTTAACTAG TACGCAGAGC TCAATCTAGA GGGTATTAAT AATGTTCCCA 
TTGGAGGATG ATTAATG 



63 



EP0459 630A2 



FIGURE 7 
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FIGURE 8 
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FIGURE 9 
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FIGURE 10 
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FIGURE 11 
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FIGURE 12 




69 



9 



® 



J 



Europaisches Paten tamt 
European Patent Office 
Office europeen des brevets 




(Tj) Publication number: 0 459 630 A3 



EUROPEAN PATENT APPLICATION 



@ Application number : 91303868.3 
(2) Date of filing : 29.04.91 



© Int. CI. 5 : C12N 15/27, C12P 21/02, 
C07K 15/00, A61K 37/02 



(So) Priority : 30.04.90 GB 9009623 
20.06.90 GB 9013773 

24.07.90 GB 9016215 

11.02.91 GB 9102799 



(3) Date of publication of application : 

04.12.91 Bulletin 91/49 

(84) Designated Contracting States : 

AT BE CH DE DK ES FR GB GR IT U LU NL SE 

(Si) Date of deferred publication of search report : 

21.10.92 Bulletin 92/43 



7i) Applicant : IMPERIAL CHEMICAL 
INDUSTRIES PLC 
Imperial Chemical House, Millbank 
London SW1P 3JF (GB) 



(72) Inventor : Camble, Roger 
8 Yew Tree Close 

Macclesfield, Cheshire SK11 8NY (GB) 

Inventor : Wilkinson, Anthony James 

4 Ravenho Lane 

Macclesfield, Cheshire (GB) 

Inventor: Carr, Heather 

16 Princess Street 

Bollington, Cheshire (GB) 

Inventor : Timms, David 

45 Beech Lane 

Macclesfield, Cheshire (GB) 

@ Representative : Mack, John Richard et al 
ICI Group Patents Services Dept PO Box 6 
Shire Park Bessemer Road 
Welwyn Garden City Herts, AL7 1HD (GB) 



3 



(g) Polypeptides. 

(57) Derivatives of naturally occurring G-CSF hav- 
ing at least one of the biological properties of 
naturally occurring G-CSF, and a solution stabi- 
lity of at least 35% at 5 mg/ml are disclosed in 
which the derivative has at least Cys 17 of the 
native sequence replaced by a Ser 17 residue and 
Asp 27 of the native sequence replaced by a Ser 27 
residue. 

Nucleotide sequences coding for part or all of 
the amino acid sequence of the derivatives of 
the invention may be incorporated into auton- 
omously replicating plasmid or viral vectors 
employed to transform or transfect suitable pro- 
caryotic or eucaryotic host cells such as bac- 
teria, yeast or vertebrate cells in culture. 
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